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Abstract 

Consider a system X — ((x^(t)),^ € fi./v)t>o of interacting Fleming- Viot diffusions with 
mutation and selection which is a strong Markov process with continuous paths and state space 
(V(T)) nN , where I is the type space, Qn the geographic space is assumed to be a countable 
group and V denotes the probability measures. 

We establish various duality relations for this process. These dualities are function- valued 
processes which are driven by a coalescing-branching random walk, that is, an evolving particle 
system which in addition exhibits certain changes in the function-valued part at jump times 
driven by mutation. 

In the case of a finite type space I we construct a set- valued dual process, which is a 
Markov jump process, which is very suitable to prove ergodic theorems which we do here. The 
set-valued duality contains as special case a duality relation for any finite state Markov chain. 

In the finitely many types case there is also a further tableau-valued dual which can be 
used to study the invasion of fitter types after rare mutation. This is carried out in |DGsel| 
and |DGInvasion| . 
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Background and motivation 

We develop here an extensive duality theory for interacting Fleming- Viot processes with selection 
and mutation, which we shall exploit further in |DGsel| and [DGInvasion] . 

Consider the following spatial multitype population model. The state of a single colony is 
described by a probability measure on some countable type space, the geographic space is modelled 
by a set of colonies and the colonics (or domes) are labelled with the countable hierarchical group 
£In or some other countable Abelian group. The stochastic dynamics is given by a system of 
interacting measure- valued diffusions and the driving mechanisms include resampling (pure genetic 
drift) as diffusion term, migration, selection and mutation as drifttcrms. Resampling is modelled 
in each colony by the usual Fleming- Viot diffusion. The haploid selection is based on a fitness 
function on the space of types and the mutation is type- dependent. The model belongs to a class 
of processes which have been constructed via a well-posed martingale problem by Dawson and 
Greven |DG99j . 

More precisely the model we use arises from the particle model driven by migration of particles 
between colonies, and in each site by resampling of types, mutation and selection. Increase the 
number of particles as e~ 1 and give them mass e. Then as e —> 0, a diffusion limit of interacting 
multitype diffusions results if the resampling rate is proportional to the number of pairs and there is 
weak selection, i.e. selection occurs at a rate decreasing with the inverse of the number of particles 
per site. Otherwise with strong selection, i.e. selection at a fixed rate, we get the deterministic 
limit, often referred to as the infinite population limit. 

An important technical tool in the analysis of Fleming- Viot processes are representations of 
the marginal distributions of the basic processes in terms of expectations under the appropriate 
function-valued, respectively set-valued, dual processes. 

Recall that two Markov processes (Z(t))t>o, (Z'(t))t>o on Polish spaces E, respectively E' are 
called dual w.r.t. to the duality function H : E X E' — »• M, if for (deterministic) initial states Zq 
respectively Zq for Z respectively Z', the following identity holds: 

(0.1) E[H(Z(t),Z' a )]=E[H(Z Q ,Z'(t))] , V Z Q e E, Z' Q e E' . 

This often allows us to draw conclusions on the process Z from a (simpler) process Z' on E' . 

It has been realized for a long time that duals play an important role in the analysis of interacting 
particle systems |Lig85| and for measure- valued diffusions [D] and many results are obtained via 
this technique but for population genetics models this is of particular importance. See also [EK2 
for a very general approach. 

Some (but not all) of these duality relations can in the case of population models be interpreted 
in terms of the genealogical tree of a tagged sample and this will be the case here. Very often for a 
given process various different dual processes are available using different spaces E' and different 
duality functions H. Depending on the application the use of different duals is in fact sometimes 
necessary. 

Dual process representations for Wright-Fisher processes with selection and migration were first 
introduced by Shiga and Uchiyama in jS2j , jSl] and jSUj . These dual representations were extended 
in Dawson and Greven jDG99j to spatial models with arbitrary type space, namely interacting 
Fleming- Viot diffusion with selection and mutation and lead to a Feynman-Kac duality. The 
latter can be used to establish that the martingale problem we formulated below in (|1.24|) is indeed 
well-posed and to derive results on the longtime behaviour for sufficiently large state- independent 
mutation. Duals for particle models of populations with selection have been introduced first by 
Krone and Neuhauser in |KN97j . An interesting class of duality relations is studied by Athreya 
and Swart in |ASj . 

However in order to study the long-time behaviour in general we have to go beyond the 
Feynman-Kac duality in |DG99j and we have to develop finer dual representations for the hi- 
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erarchically interacting system on Sljv (or any other geographical space). It is often also useful 
to consider the nonlinear Markov process (McKean-Vlasov process) arising as the mean-field limit 
N — >• oo of an N site exchangeable model and also for this case one obtained a duality. 
The duality we shall introduce works for 

• multitype selection, 

• state- dependent mutation and 

• has a historical and genealogical interpretation and in fact extension to tree-valued processes 
(see [DGP] 1. 

• it can also be extended to multiple species and multilevel models (see jD2011j ). 

The goal of the dual construction is to construct for a sample of n-individuals from the popu- 
lation at a given time t the probability law of the collection 

(0.2) {(type of individual i, genealogical distance of pairs of individuals (k, £)) , V i, k,t € 1} . 

The key point for the analysis is that we have a function-valued dual process, that is, in addition 
to a dual particle process we have a process which is function- valued but driven by a particle system. 
In the case of a finite type space we derive a refined dual which takes values in sums of products 
of indicators respectively a set-valued dual. Both these duals are very powerful analysing the 
longtime behaviour. 

In case of two-type and state-dependent mutation our dual is a relative of the ancestral selection 
graph of Krone and Neuhauser |KN97j , but note that the latter does not have the form of a duality 
with another backward Markov process. 

1 The model 

We now introduce the system of interacting Fleming- Viot diffusions with mutation and selection, 
where the interaction is due to migration between the colonies indexed by the geographic space. 

1.1 Ingredients 

First we introduce the state space of the process and the basic parameters of the stochastic evolu- 
tion. 

(a) The state space of a single component (describing frequencies of types) will be 

(1.1) V(T) = set of probability measures on I. 

The set of colonies (sites or components) will be indexed by a set Q,n, which is countable and 
specified in (/?) below. The state space X of the system is therefore 

(1.2) X = (T(I)) nN , 

with the product topology of the weak topology of probability measures on the compact discrete 
set X. A typical clement is written 

(1.3) X = {x e Uen N with x e E P(I). 

(/3) The hierarchical group fijv indexing the colonies of the geographic space is defined by: 

(1.4) n N = {e = (e) ieNo \e g z, o < e < n - 1, 3k ■. ? = o vj > k }, 
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with group operation denned as component-wise addition modulo N. A typical clement of f2jv is 
denoted by £ = £ x , . . .). Here N is a parameter with values in {2, 3, 4, ...}. 
Note that: 

OO 

(1.5) ftjv=@Zjv, ^ = {0,---,iV- 1} with addition mod (N). 

i=0 

We also introduce a metric (actually an ultrametric) on H^v denoted by d{-, •) and defined as 

(1.6) dfo?) =inf{fc|f = (£') j Vj>fe}. 

The use of fi^ in population genetics in the mathematical literature goes back to Sawyer and 
Felsenstein |SFj . An element , £ 2 , . . .) of £7jv can be thought of as the £°-th village in the 

f^-th county in the £ 2 -th state .... In other words we think of colonies as grouped and classified 
according to "extent of neighborhood inclusion" . 

Everything we state here remains valid if we replace fi^v by a countable group like Z d for 
example. 

(7) The transition kernel a(-, ■) on Ojv x fijv, modelling migration rates has some specific prop- 
erties. We shall only consider homogeneous transition kernels: 

(1.7) a w (£,O = M0,r-0 V^'efiiv 

Example 1 Choose the hierarchical group with a multi-level symmetry property: 

(1.8) aN (0,0=J2(w^T))^ ifd(0,0=3>l, 

k>j 

where 

00 

(1.9) c k > V k e N, JFk < 00 for all N > 2. 

k=0 

The kernel oat(-, •) should be thought of as follows. With rate Ck-i/N < - k ^ 1%> we choose a hierarchical 
distance k, and then each point within distance at most k is picked with equal probability as the 
new location. Then il.9\) requires a finite jump rate from a given point. 

It is often convenient to write the transition rates as 

(1.10) cajv(-, •) , c £ M + and probability transition kernel on I x I. 

Then we can say that c is the migration rate and aAr(£, £') the probability that a jump from £ to 
£' occurs. 

(5) In addition to describe mutation and selection we need two further objects. Let 

(1.11) M(-, •) be a probability transition kernel on I x I, 

modelling mutation probabilities from one type to another. Furthermore let 

(1.12) x(0 be a bounded function on I, < x(-) < 1, = min x> 1 = SU PX; 

modelling relative fitness of the different types. We have set here the arbitrary minimum value 
of x equal to zero and then in order to uniquely specify the parameter s (representing selective 
intensity) we have set the maximal value of \ equal to one. Using x we can embed I into [0, 1] 
rather naturally and use on it the relative topology induced by the euclidian topology on [0,1]. 
The case we use in [DGselj is as follows: 
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Example 2 We consider a hierarchically structured set of types which allows to model a fitness 
function acting on a whole hierarchy of scales. We set: 

(1.13) I = (No X {1, 2, ... , M}) U (0, 0) U (oo, 1), 

where Nq = {0} U N and M is an integer satisfying 1 < M < N. 

Adding the maximal point (oo, 1) allows us to work with a compact set of types and as a 
consequence a compact Polish state space if we embed the type set in the interval [0, 1] appropriately 
using the relative topology. 

Then we can write: 

oo 

(1.14) I=U^, 

3=0 

where 



(1.15) Ej = {(J, £);£= M}, j € N, E = {(0,£), I = 0, M},E^ = {(oo,l)}. 



1.2 Characterization of the process by a martingale problem 

We now proceed to rigorously specify the model by formulating the appropriate martingale problem. 

The key ingredients for the martingale problem with state space E (Polish space) are a measure- 
determining sub-algebra A of Cb(E,M), the socalled test functions, typically called F here, and a 
linear operator on Cb(E,M), typically called L, with domain A resulting in values in Cb(E,M.). 

Definition 1.1 (Martingale problem) 

(a) The law P on a space of E-valued path for a Polish space E, either D([0, oo), E) or 
C([0,oo), E), is a solution to the martingale problem for (L,v) w.r.t. A if and only if 

(1.16) lF(X(i))-f(LF)(X(s))ds\ is a martingale under P for all F e A 
V o J t >o 

and 

(1.17) C(X(0)) = v. 

The martingale problem is called wellposed, if the finite dimensional distributions of P are uniquely 
determined by the property il.lu}) and \1. 17ty . 

(b) In our context E = (V(l)) nN . □ 

The algebra of test functions we use here is denoted by 

(1.18) AC C((V(I)) nN ,R) 

and is defined as follows. Given a nonncgative bounded function / on (I) k and £i,...,£k G fijv 
consider the function on (.M(I)) N (here M. denotes finite measures) defined by: 



(1.19) F(x) = J ■ J f(ui,---,u k )x^ 1 (du 1 )...x^ h (duk), 

i i 

&eOjv, x s eM(l), xe(M(l)f N . 

Let A be the algebra of functions generated by functions F of type (|1.19[) restricted to (V(T)) flr ' 
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Our operators L will be differential operators. We define differentiation on the big space 
(A4(I)) N , even though we deal only with restrictions to (P(T)) nN later on. This approach fa- 
cilitates comparison arguments. Hence we differentiate the function F as follows: 

dF , u i ,. F(x £ £' u ) - Fix) 
(1.20) — (*)[«] = hm-i i— U, 

with 

(1.2D *^=kw <={5 +e ^ 



Correspondingly Jq (x)[u,v] is defined as -g- [§§-(x)[u]) {x)[v\. 



Let 

(1.22) c,s,m,d>Q 

be parameters that represent the relative rates of the migration, selection and mutation, respec- 
tively the resampling rate which in biological language represents the inverse effective population 
size, finally let ajv(-, ■) be as in (|1.7[) . Furthermore x an d M are as in p. lip and (| 1 . 1 2[) . 

We now define a linear operator on Cb((V(T)) N ) with domain A, more precisely 

(1.23) L : A — ► Cb(('P(I))^ N , R), 

which is the generator L of the martingale problem and is modeling migration, selection, mutation 
via drift terms and resampling as the stochastic term (in this order), by (here d > and c, s, m > 0): 



(1.24) (LF)(x) =E ?6 n, 



J £ 

-(u) (x(«) - / x(w)x$(dw) 




( v )M( U ,^)-^^(u 
axe 




(u,v)Q x Jdu,dv) 



:r 6 (P(I)) J 



where 

(1.25) Q x (du,dv) = x(du)5 u (dv) — x(du)x(dv). 

It has been proved in |DG99[ 99] that a model of the type as above is well defined: 



Theorem 1 (Existence and Uniqueness) 

Let v be a probability measure on (P(I)) N specifying the initial state which is independent of the 
evolution. 

(a) Then the [L\v) -martingale problem w.r.t. A, on the space C([0,oo) (cf. Definition ] 
(■p(I)) 0jY ) is well-posed. For fixed value of the parameter N the resulting canonical stochastic 
process is denoted 

(1.26) (X t N ) t > . 

(b ) The solution defines a strong Markov process with the Feller property. □ 
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In the sequel we shall consider often initial states which are either deterministic or random with 
law v in V((V(T)) nN ) which satisfies the following two properties: 

(1.27) v is invariant w.r.t. spatial shift on Ojv, 

(1.28) v is shift ergodic. 

2 Function- valued dual 

In this section we first recall the Feynman-Kac duality from |DG99j , next we develop the new dual 
representations for the interacting system. This allows us in particular to establish the ergodic 
theorem for the mean-field limit with state- independent mutation component. 

We then develop a further modified dual that covers state- dependent mutation in order to 
prove for example ergodic theorems or results describing the transition from one quasi-equilibrium 
to the next (see [DGselj ). This latter dual representation is very flexible and allows for various 
refinements useful for specific purposes. In particular the multi-scale analysis requires some new 
arguments due to the interplay between selection and mutation. Each of these arguments requires 
some specific features of the dual representation which requires some slight modifications of the 
dual mechanism that arc developed here in this section systematically. 

In |GLW| and |GPWmp] it is shown that for the neutral model with migration the duality for the 
marginal distribution arises from a dual representation of the genealogical process of the underlying 
particle process. Roughly speaking the dual dynamics generates the law of the genealogical tree 
of a randomly sampled finite tagged population among the total population at a given time t in 
terms of the genealogical tree associated with a coalescent. This raises the question to what extent 
this can be extended to processes with mutation and selection. This remains to be investigated 
and will be discussed later on in Subsection 12.41 On the level of genealogies such a dual is used in 
[PUP] . 

Outline of Section [2] We begin in Subsection 12. II to introduce the duality functions and the 
dual process and give in Subsection 12.21 the basic duality relations on which one can base all the 
modifications and refinements we present later on. In Subsection 12.31 we give special versions of 
the dual representation of the mutation useful for the different possible applications of the dual we 
have a need for. In Subsection 12.41 we discuss the historical meaning of the dual representation. 
In Subsection 12.51 we derive a very important refinement of the duality relation for models with 
finitely many types. This allows also to obtain a historical interpretation of the dual which we 
discuss in Subsubsection |2~5.4I 

2.1 The dual process and duality function for the interacting system 

In this section we introduce the basic ingredients of the duality theory. In order to obtain a 
dual representation of a (V (I) ) nN -valued process (X t )t>o we need two ingredients, (1) a family 
of functions on the state space that is measure-determining from which we construct the duality 
function H{-, •) and (2) the stochastic dynamics of a dual process whose state space is the set which 
labels the family in the previous point. 
(1) Duality functions 

We begin with the first point. Recall that in order to determine the marginal distribution at 
time t of the law P Xo of the system X{t) = (x ( (t))^ e a N , with initial point X(0) = X E (T(T)) nN , 
it suffices to calculate for every fixed time t a suitable form of mixed moments, i.e. we have to 
determine the following quantities parametrized by (£cb /) (the initial state of the intended dual 
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dynamics) with £o a vector of geographic positions and a basic function / on finite products of the 
type space I: 



(2.1) F t (fa,f),X ) 



E 



/(til, ■ . . ,U n )x io (i)(t)(dUi) . . .X£ ( n )(t)(dUn) 



for all n £ N, Co = (£o(l), • • • , £o(n)) 6 (fij\r) n and / e L oa ((I) n ,R). This means we want to 
consider the bivariatc function 



(2.2) H(X, (£, /))= / ■ • ■ ,u„)a; c(1) (dui) ■ • ■ x i{n) )(du n ) 

i" 

to obtain a duality. This would however only allow for a dual for the migration part. Namely 
introducing the dual process we shall see that we have to enrich the second argument further, to 
be able to obtain a Markov process as a dual, namely resampling and selection require to introduce 
partitions of the variables. 

The class given above contains the functions which we will use for our dual representations. In 
fact it suffices to take smaller, more convenient sets of test functions /: 



(2.3) f(ui,...,u n )=Y[fi( Ui ), /ieioo(I). 



Since in our case I is countable, it would even suffice to take functions 
(2.4) / 3 -(«) = l 3 -(«) or /,•(«) = l Ai («), 

where lj is the indicator function of j 6 I and in case of finite type sets, 1a with A some sets of 
types for example like {u\x(u) _! efc} if = eo < • • • < eg = 1 are the fitness values, which will be 
a very useful choice later on. 



Remark 1 (Notational convention). 
In this section we write 

(2.5) X t instead of X(t) 

on the level of the collection (but we still write x^(t) for components). 

(2) Dual process: Spatial coalescent with births driving a function-valued process 

Now we come to the second point, the dual process on a suitable state space. We observe here 
that due to the interaction of selection and mutation the state space of the dual process is more 
complicated than for most particle systems or interacting diffusions. We proceed in four steps. 

Step 1 (The state of the dual process) 

We begin this point by reviewing the dual process used in |DG99j which involves a function- 
valued process (J t ) t >o driven by a finite ordered particle system (rjt)t>o- Indeed in |DG99j the 
quantities on the r.h.s. of (J2.1[) were expressed in terms of the evolution semigroup corresponding 
to a 

(2.6) bivariate Markovian stochastic process (n t ,Tt)t>o 
consisting of 
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• a process rj involving a random set of finitely many ordered individuals, 

• a function- valued process J 7 . 

This is the dual process to be discussed in the sequel where r\t takes values in the set of marked 
partitions of a random set of individuals and Tt is a junction defined on products of the type set 
I. The joint evolution of these is such that the dynamic of J- t is driven by the process rjt which 
evolves autonomously. 

We will use two main types of such processes which are denoted by 

(2.7) (riuFt), respectively (vt,^) or its variant (n ti gf) t > , 

where the first component, r\ t is a particle system described formally below and the second com- 
ponent J r t,J 7 t', takes values in the space of functions on I N which are measureable, bounded and 
depend only on finitely many components which we can identify with 

oo oc 

(2.8) |J ^((IT), resp. (J ^(00™)- 

m— 1 m— 1 

This process, called the function-valued dual process, will allow us in Subsections 12.21 and 12.31 
to represent for every given Xq the function F t from (|2.1[) as the expectation of an appropriate 
functional over the evolution of (7/4, J^) or (r/ t , J 7 ^) respectively (rjt, J 7 ^)- We construct it here in 
such a way (for example its order structure) that it can be refined later better for some special 
applications. 

The process (j]t, J~t)t>o is constructed from the following four ingredients: 

• N t is a non-decreasing N-valucd process. N t is the number of individuals present in the dual 
process with N Q = n, the number of initially tagged individuals, and with N t — N a > given 
by the number of individuals born during the interval (0,i\. 

• £ t = {1, • • ■ , Nt} is an ordered particle system where the individuals are given an assigned 
order and the remaining particles are ordered by time of birth. 

r h = (Cti n t-,Ct) is a trivariate process consisting of the above (~ and 

— 7r t : partition [itj, ■ ■ ■ ,7r t ) of £t, i.e. an ordered family of subsets, where the 

index of a partition element is the smallest element (in the ordering of individuals) 

of the partition element. 
— £t : Tt — ► ^!v*') giving locations of the partition elements. Here £0 is the vector 

of the prescribed space points where the initially tagged particles sit. 

• J-t is for given rj t = (Ct,nt,£t) is a function in L 00 (ll 7r *l). 

Therefore the state of r\ is an element in the set 

00 

(2.9) S = (J {1, 2, • • • ,m} x Part<({l, ■ • • ,m}) x {£ : Part<({l, • ■ • , m}) — > Ci N }, 

where Part < (A) denotes the set of ordered partitions of a set A C N and the set {1, • • • , m} is 
equipped with the natural order. 

It is often convenient to order the partition elements according to their indices 

(2.10) index (71^) = min(k G N : k G 7r|) 
and then assign them ordered 

(2.11) labels 1,2,3, •••,|tt 4 |. 
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Remark 2 Note that 

(2.12) N t = | 7 r t 1 | + --- + | 7 rl" l |. 

Remark 3 T7ie process rj can actually be defined starting with countably many individuals lo- 
cated in f2iv- This is due to the quadratic death rate mechanism at each site which implies that the 
number of individuals at any site will have jumped down into N by time t for any t > 0. 

Remark 4 We view the configuration of(n s ) s < t as the description of the genealogy of the sample 
drawn at time t observed back at time t — s. We interpret Nq = n as the sample size at time t and 
N s as the population at time t — s which can potentially influence the type of the individuals in 
the sample at time t. Then (£ s )s<t give the ancestral path of the individuals which are potentially 
relevant for the n-sample at time t. The partition ir s describes the groups of individuals at time t 
which have a common ancestor at time t — s. See Subsection \2.J\ and Subsubsection for more 
information. 



Remark 5 We can now formulate our dual process into the classical scheme of duality theory 
(see \0.1\) . We write X' for the pair (77, J 7 ), 

00 

(2.13) E= (V(l)f N , E' = {X' eSx(\J L 00 (I m ))\m=\n\}. 

m— 1 

If we use the standard topology on the set E, it is a Polish space. 

E' can be embedded in a complete metric space which is a subset of S based on ordered partitions 
of N (rather than finite sets) and the set of functions in L oc (I N ) which can be approximated by 
functions of finitely many variables. If we want to achieve separability of E' one needs to assume 
more on x and H to be able to restrict the function space to continuous functions on I. However 
one should note that the measures appearing as states of X are always at most countably supported 
for positive time. 

Then define a map H : E x E' R by 

(2.14) H(X, (77,^")) = J ■■■ J T{u\, ■ ■ ■ ,u\ n \)x^ t )(dui) ■ "^(KD^m)- 

1 1 

Then the collections of bounded measureable functions 

(2.15) {H(;(n 7 E)),(n 7 E)eE'} 7 ({H(X,-) 7 X E (V(I)f»}) 

are measure- determining on (E, 13(E)) respectively (E' ,B(E')). The second property is not always 
satisfied in the duality theory and typically not needed in applications. Then H(-, ■) is called a 
duality function for the pair (E, E') and E and E' are legitimate state spaces for Markov processes, 
recall Remark^ (See fLig85j and \EK2^ for detailed expositions of duality theory). 

We describe the dynamics of the process in two steps, first we give the autonomous dynamics 
of (r]t)t>o and then construct (J r t)t>o respectively (J r t + )t>o or (Gt~)t>o given a realization of the 
process rj. In subsequent sections we introduce various modifications of this tailored for specific 
purposes. 

Step 2 (Dynamics of dual particle system) 

The dynamics of {rj t } is that of a pure Markov jump process with the following transition 
mechanisms which correspond to resampling, migration and selection in the original model (in this 
order): 
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• Coalescence of partition elements: any pair of partition elements which are at the same site 
coalesce at a constant rate; the resulting enlarged partition element is given the smaller of 
the indices of the coalescing elements as index and the remaining elements are reordered to 
preserve their original order. 

• Migration of partition elements: for j G {1, 2, . . . , |7r t |}, the partition element j can migrate 
after an exponential waiting time to another site in £1^ which means we have a jump of Ct(J)- 

• Birth of new individuals by each partition element after an exponential waiting time: if there 
are currently N t individuals then a newly born individual is given index Nt- + 1 and it forms 
a partition element consisting of one individual and this partition element is given a label 
\irt—\ + 1- Its location mark is the same as that of its parent individual at the time of birth, 

• Independence: all the transitions and waiting times described in the previous points occur 
independently of each other. 

Thus the process (r]t)t>o has the form 

(2.16) m = (Ct, 7r t ,(6(i),6(2),---,6(kl)))> 

where G S7jv, j = 1 , . . . , 1 7r* | , and n t is an ordered partition (ordered tuple of subsets) of 

the current basic set of the form {1, 2, . . . , Nt} where N t also grows as a random process, more 
precisely as a pure birth process. 

We order partition elements by their smallest elements and then label them by 1,2,3,... in 
increasing order. Every partition element (i.e. the one with label i) has at every time a location 
in Q at, namely the i-th partition element has location £t(i). (The interpretation is that we have 
N t individuals which are grouped in |7r t |-subsets and each of these subsets has a position in fijv). 

Furthermore denote by 

(2.17) n t (l),-'-M\*t\) 

the index of the first, second etc. partition element. In other words the map gives the index of a 
partition element (the smallest individual number it contains) of a label which specifics its current 
rank in the order. 

For our concrete situation we need to specify in addition the parameters appearing in the above 
description, this means that we define the dual as follows. 

Definition 2.1 (First component of the dual process: rj t ) 
(a) The initial state t]q is of the form (Coj^Oj^o) with: 

(2.18) Co = {1,2, ••',"}, = {{!}>•••,{»}} 

(2.19) Co g (n N y\ 

(b) The evolution of (rjt) is defined as follows: 

(i) each pair of partition elements which occupy the same site in Qn coalesces during the joint 
occupancy of a location into one partition element after a rate d exponential waiting time, 

(ii) every partition element performs, independent of the other partition elements, a continuous 
time random walk on f2jy with transition rates ca(-, •) (see Jj.Tp J, with &(£,£') = &(£' ',£), 

(Hi) after a rate s exponential waiting time each partition element gives birth to a new particle 
which forms a new (single particle) partition element at its location, and this new partition 
element is given as label \irt— \ + 1 an d the new particle the index Nt- + 1. 
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All the above exponential waiting times are independent of each other. □ 

Note that this process is well-defined since the total number of partition elements is stochastically 
dominated by an ordinary linear birth process which is well-defined for all times. 

Step 3 (Dynamics of function-valued part of dual process: J- t , J-^~ ,Gt~ ) 

To complete the specification of the dual dynamics we want to define a bivariate process (rj t , Ft) 
resp. (n t , J" t + ) associated with (|1.24[) . Hence we now describe the evolution of the process(J 7 t )t>o 
conditioned on a realization of the process (rjt)t>o- For a given path of the first component (r?t)t>o 
the second component is a function-valued process (J-t)t>o, respectively (J-" t + )t>o, (Gt~)t>o with 
values in (J?° ^oo(I m ), respectively L+ (IT). 

The evolution of J~t, respectively J-" t + , starts in the state 

(2.20) Jo = J + = /> 

with a bounded function / of |7To (-variables, a parameter we denoted by n, the variables running 
in I. 

The evolution of (J-t)t>o involves three mechanisms, corresponding to the resampling, the se- 
lection mechanism and the mutation mechanism. We now describe separately these mechanisms. 

Definition 2.2 (Conditioned evolution of J-,J- + : the coalescence mechanism) 

If a coalescence of two partition elements occurs, then the corresponding variables of T t are set 
equal to the variable indexing the partition element, (here Uj denotes an omitted variable), i.e. for 
J- t = g we have the transition 

(2.21) g(m, - ■ ■ ,Ui, - ■ ■ ,uj, ■■■,u m ) — > g(ui,- ■ ■ ,u 3 , ■ ■ ■ ,u m ) 

= g{U\, ■ ■ ■ ,1H, ■ ■ ■ ,Uj-!,Ui,Uj + i, ■ ■ ■ ,U m ), 

so that the function changes from an element o/Loo(I m ) to one o/Loo(I m_1 ). □ 
Remark 6 We could also work with 

(2.22) Ji(u 5f (i), • • • ,u?f f (jv t )) instead of Ttiui, - ■■ ,u^ t \)). 

This means in \2.2V) we view the r.h.s. as element of L 00 (I m ). Then the state space E' is obtained 
by replacing in h2.1S\) the restriction by m = \C\ and using the duality function H(-, ■) given by 

(2.23) H(X, (q, T)) = J J(it S (i),---,u ff(A r t )))a; 4 (i)((i'Ui)---x 4 (| 7r |)(dit| 7r |). 

The form in \2. 22)) codes some historical information which is lost in the form in \2.11$ . On the 
other hand expressions in the duality relation become often simpler in the reduced description. 

We continue with the selection mechanism and its dual. There are several alternate ways 
to incorporate the effects of selection into the dual. In particular some versions involve signed 
function-valued processes and others use only non-negative- valued functions. Some versions involve 
a Feynman-Kac factor in the representation and others do not. We now describe those versions of 
the dual process that will be elaborated on as required in later sections. 

Definition 2.3 (Conditioned evolution of T andJ 7+ ,Q + : the selection mechanisms) 
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(i) If a birth occurs in the process (rj t ) due to the partition element 7Tj, i £ {1, . . . , |7r t |} , then 
for Tt- = g the following transition occurs from an element in L oc ((I) m ) to elements in 

ioo((I)™ +1 ).- 

(2.24) g(u%, . . . , u m ) — > x(«t)fl(«i, • • • , "m) - x(«m+l)ff(«i, • • • , «m)i 

where the new variable is associated with the partition element of the newly born individual. 
(i 1 ) For the transition \2. 2J$ is replaced by (provided that \ satisfies < x < Ij: 

(2.25) g(ui, ■■■ , Um) > 9(ui, Um+l) = + C 1 ~ xC"ro+l)))s(wi, - ' " > u m), 

where the new variable is associated with the newly born individual. In particular 

(2.26) Halloo < 2H.9IU, and g>0, if g > 0. 
(i" ) For (<5 t + )f>o we use the following transition: 

(2 27) 9 ( Ul ' '"' — > (x{ui)l(u m +i)g(ui, u m ) 

+ (l-X(Ui))5(ui,---,Ui_i,U TO+ i,U i+ i,---,U m ), 

in which case 

(2.28) HfflU < IMU. □ 

Remark 7 TTie mechanism (i) will lead to a signed-function-valued process and requires a 
Feynman-Kac factor, while the mechanism (i' ) leads to a non-negative function- valued process 
and in this case NO Feynman-Kac factor is needed. The mechanism (i" ) induces the same change 
in the duality function H as (i 1 ) and therefore leads also to a duality function. 

Remark 8 We note that the dynamic (i" ) disconnects the strict parallel between the variables 
in the function and the individuals in the process r\ which corresponds to the fact that an individual 
in the sample may be replaced by one from outside the sample or vice versa. 

We now introduce some basic objects used to incorporate the mutation mechanism into the 

dual. Let Q™ ut denote the semigroup on L 00 (I n ) induced by independent copies of the mutation 

new 

process acting on L X (I), i.e. the process we obtain from (ll.24[) by letting the geographic space 
(i.e. the index set for the components consist of one point and putting all other coefficients d, s, r, c 
in (|1.24[) equal to 0). We can represent Q™ ut in terms of the process {M t *} f >o which is a Markov 
pure jump process with jumps L oc (I n ) — > L OQ (I n ): 

(2.29) / — > Mjf at rate m for each j = 1, . , . , n 
with Mj given by ([232"j) below. Then 

(2.30) gr t (9)=E[M t *(g)}. 

is a (deterministic) linear semigroup on ngH ioo(I") (with exactly that domain) with generator 

(2.31) M* =m^2(M j -I), 

j 
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where Mj acting on L aD (I n ) is for j £ {1, • • • ,n} given by 

(2.32) Mjg(ui ,...,Uj,..., u n ) = J g(m , . . . , v, Uj+i, u n )M(uj,dv). 

i 

This allows to define: 



Definition 2.4 (Conditioned evolution of J- , T + , Gt~ '■ the mutation mechanism) 

If no transition occurs in (i) f )oo then Tt, respectively , Gt~ follows the deterministic evolution 
given by the semigroup Q™ ut acting on L 00 (I m ) provided that currently J- \ , respectively (J-^ or ) 
is a function of m variables, meaning at the last transition in (t] s ) up to time t became an element 
ofL^dl)™) □. 

This completes the evolution mechanism driving the process (J ( ) f >o respectively (J r t + ) t >o or 
(G^~)t>o for given n. In this fashion, the function- valued processes starting with an element / £ 
ioo(I ra ) for some n are uniquely defined for every given path for the process r\. 

Step 4 (Definition bivariate dual process) 

Combining Step 1 and Step 2 we have defined uniquely for every (ordered) set of n individuals 
with n locations in and a function f in L°°(I n ) a bivariate Markov process (n t , J-t)t>o, respec- 
tively (r/ t , J r t + )t>o or (%, Qf ) t >o in which the jumps occur only at the times of jumps in the pure 
Markov jump process (r)t)t>o an d changes in the function- valued part occur dctcrministically in a 
continuous way (action of mutation semigroup). 

Remark 9 One advantage of working with the process (r)t,J~t) ( or (VttJt)) over working with 
the original process is, that it can be explicitly constructed and only a finite number of random 
transitions occur in a finite time interval whereas changes occur instantaneously in countably many 
components of the process X in any finite time interval and are furthermore diffusive changes. 



2.2 Duality relation for interacting systems 



The following is the basic Feynman-Kac duality relation that was established in jDG99j for the 
general class of interacting Fleming- Viot processes with selection and mutation from which we 
derive the new duality relation in Theorem [2] below without Feynman-Kac term which we use in 
this paper. 

Proposition 2.5 (Duality relation - signed with Feynman-Kac dual) 

Let (X t )t>o be a solution of the (L, Xq) -martingale problem with L as in \1.21$ , Xq = (x^)^ e o N . 
Choose Co S (Oftr)" and f £ ^((I)") for some n £ N. (Recall (2.8]) , (2. 16\) for notation). 
Assume that t is such that: 

it, 



(2.33) E |^cxp ys J \ir r \drjj < oo. 

Then for < t < to, (rj t ,J- t ) is the Feynman-Kac dual of (X t ), that is: 



(2.34) F t (( m J),X ) = E {ri0t r y 



exp(s / \n r \dr) 
o 



i 
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where the initial state (r?o,-?"o) *s given by 

(2.35) % = [{l,..„n},$) r .,((n);({l},{2} 1 ..,{t l })] ) 

6 Oiv /or i = 1, • • • , n; n e N, 

•Fo = /• □ 

Remark 10 In JDG99f it was shown that there exists to > /or which H2. 33p is satisfied. 

The unpleasant features of the above dual are the exponential term and the signed function, 
which make it difficult to analyse for t —> oo. However if we have a fitness function \ satisfying 
< x fs 1 (which because of scaling properties of the dynamic really only means that we do not 
allow unbounded fitness but cover all other cases) then we can obtain the following duality relation 
that docs not involve a Feynman-Kac factor and preserves the positivity of functions: 

Theorem 2 (Duality relation - non-negative) 

With the notation and assumptions as in Proposition \2.5\ ( except \2.S3\) we get for x with 

(2.36) < x < 1 
that for all t 6 [0, oo) we have: 

(2.37) F t (( Vo J)),Xo) = E {vqiK) 



J g+( Ul , - ■ ■ ,u lntl )x (t{1) {d Ul ) ■ ■ ■ x itiM) {du l7rtl ) . 

I 

Moreover, is always non-negative if = />0, similarly for if Gq — f > 0. □ 

Remark 11 The duality relations we develop here for Qn work in fact for every geographic 
space which forms a countable group and for every migration mechanism induced by a random walk 
on that group. With some care we can even pass to migration given by a general Markov chain. 

Remark 12 The duality relations work for general type space for example a continuum [0, 1] as 
well, the special structure of I has not been used. 

We note that the duality relation given above is of the general form considered in duality 
theory but the state space is more complex than in the classical cases for Fisher- Wright diffusions 
or measure-valued processes as demonstrated by the following remark. 

Remark 13 In the language of classical duality theory as described in Remark \5\ we have ab- 
breviating X' t = (ritjFt), the duality relation 

(2.38) E Xo [H{X t ,X^]=E x ,[H(X ,Xl)}, Vt>0 

whenever (Xq,Xq) G E x E' . Therefore the l.h.s. which is the object of interest can be calculated 
in terms of the r.h.s. involving a process of a simpler nature. 

For Feller processes on compact state spaces with generators Gx respectively Gx' , respectively, 
then the duality relation follows from the generator relation 

(2.39) (G X H(;X^))(X ) = (G Y H(X , -))(X' Q ), for all (X 0) X ) in E x E' 

(see e.g. fLig85y , Chapt. 2). On non-compact spaces some integrability properties have to be 
verified in addition (see \EK2\j . Chapt. 4)- 



J" t + (-Ui, • ■ • , u M )x it{1) (dui) ■ ■ ■ X£ t ( M) (du\„ t \) 



E 



(vo,G 
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The duality relation above can be extended as follows. 

Remark 14 It is often necessary to consider moments of the time-space process and then the 
process is observed say at a time t and t + s. In this case one considers the dual particle system 
where the particles corresponding to the moment at the earlier time t are activated in the dual 
process only at times s and the dual is evaluated after evolving till time t + s. This observation has 
first been used in the voter model and is quite useful for us later on in some calculations. 

The duality relation in Theorem [5] can be related to the previous FK-duality in [DG99 quoted 
in Proposition 12.51 if we observe that the action of the selection part of the generator of X can be 
written in two ways, namely as (assuming that x is bounded by 1 and in the expression X9 below 
taking x to be a function of one of the variables of g): 

(2.40) {( X g -X®9)~ <J} + 9 = (X9 + (1 - x) <8 g) - 9- 

The first expression, that is, {(x9 ~ X ® 9) ~ ff} + 9 corresponds to a jump defining the transition 
of Tt (the term in {}) plus a Feynman-Kac term g as we have in the duality in (|2.34[) . The 
second expression, that is, (x9 + (1 — x) ® 9) ~ 9 corresponds to a jump describing which now 
yields as new function-valued state or function consisting of two summands but does not require a 
Feynman-Kac factor. Since integrability issues are involved we now give a detailed proof below. 
Proof of Theorem [2] 

We have to do two things, (1) verify the generator relation and (2) an integrability condition 
to guarantee that the r.h.s. in (|2.37[) is always finite. 

(1) Generator relation. Denote the generator of the process by L and of the dual process 
(77, J r t + )t>o by L' . Then we have to verify for the duality function H that 

(2.41) (LH (-, (/, V )))(X) = (L'H(X, •))(*?, /)• 

In order to verify the generator relation (|2.39j) . we have to calculate LF, recall (jl.24j) and for that 
we evaluate first the first and second order differential operators acting on F. 

Fix neN and a map £ : {!,••• ,,n} — > f2jv- We need to calculate for functions G with 



(2.42) F(X) =j /(«!,- ■■,u n )x m {du l )---,x t{n) {du n ), Ie(P(I)) ! 



X = ®? N Xi 



the first and second order derivatives (recall (|1.20j) - (|1.21|l ). Note that the {£(j) : j = 1, . . . , n} are 
not necessarily distinct and that repetitions can occur. 
For £ g n N (cf. (OCT ) 

/ „ \ 

dF(X) I r 

(2.43) — [v]= ^2 / f(uu--',ue- 1 ,v,u l+ i,-'-,Un)(£)xt(j) 

R <€{l,...,n} \J 



. d 2 F(X) r 
(2.44) —±-L[ V}V >] 



= ^2 \ f{ui,---,ut-i,v,ut+i,---,ut>-i,v',ut> + i,---,u n ) (^) X£(i)(dut) 
Note that if \{£ : £(£) = £}| = k, then there arc f 9 ) summands. 
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We can now apply this formula to our mixed moment F, recall (|2.1[) . Recall (|1.24[) 



(2.45) (LF)(X) = (i mlg + L so1 + L nmt + L sam )F(X) 



E 



+ s 



m 



(v)M(u,dv) - ^^(«) } x ( (du) 



Ox, 



dxe 



1 K I 

d 2 F{x) 
dx^dx^ 



(u,v)Q x Adu,dv) 



x e (P(l)Y 



where Q x (du,dv) is given by (|1.25[) . 

We now apply this to the function F(-) = H(-, (77, J 7 )) (cf. ([2TT3)) ). 
We first consider the action of L mut using (|2.42[) . 

(2.46) L™*H(X,( v ,Fi) 

- •»/( E 

( [Jf( u ii ' ' • > u > u ^+i' " ' ) u n )M(u, dv) - f(ui, ■■■ , ue-i, u, ut+i, ■■■ , u n )] ) 

n 



= 1 



H(X, (r),M*f)). 
We next consider the action of L mlg using (|2.42l) 
(2.47) L^H{X,{ Vl T)) 

= c 



/ ( E ( / f( u ir--,u n ) 

n n n 



= (X, £ MV, /)) = L'^H (X, ( V , T)) 

1=1 rf 

where rf t = = CO'), j ^ - = a N (Z(£),?). 

Consider the following function on the state space of the dual process. Fix an element X S 
(V{T)) Un and define for neN, partition tt of {1, • • ■ , n}, map £ : {1, • • ■ , \n\} -4 rjjv and / : iW -» 
K+, define 
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(2.48) G((?7,/))= / /(in, •••,« w )a;£(i)(d«i)---a;£( W )(du| ir |) 



/(«e(i) 5 ---)«£(H))(R)a:«(d««) = (*?>/))• 



We have to calculate the action of the generator L', i.e. we have to determine (L'G)((rj, /)), where 
L' is the generator of the pure Markov jump process with a piecewise deterministic part (for the 
mutation) which we defined as dual process. We obtain by explicit calculation 

(2.49) L'^H(X, ( V , /)) = (L'G)((/, rj)) = 

/n 

-/(««(x)'"* - '"«H)))] ( S> x d du 

= L™*H(X, (77,/)). 

Combining both (|2.47|) and ()2.49[) we obtain the claim (|2.41l) . The generator calculations 
corresponding to selection and resampling (coalescence) follow in the same way (compare [DG99 ) 
and this completes the required generator calculation. 

(2) Finite expectation 

Here we have to show that 

(2.50) E iv0t r )[H(X, ( Vt , T t ))] < oo for all t > 0, 

in order to use Theorem 4.11 in |EK2j to conclude the duality relation from the generator relation 
we established in the previous point. We begin by verifying this for some to > which is sufficiently 
small. 

Namely we realize first that all transitions occuring in the dual preserve the || • | loo-norm of Tt 
except the selection transition. Here we have g — > X9 + ~ x) ® 9 which satisfies 

(2.51) ||xff+(l-x)®fll| o<2||ff|| 00 . 

Since the number of selection events is given by a pure birth process with birth rate s we have 

(2.52) Halloo < 2** ■ \\TqWvo. 

Hence by explicit calculations of the Laplace-transform of N t we have: 

(2.53) E[\\F t \U < (1 - p )(-JL_yv°||J- || < oo, if p = (1 - e- st ) < \. 
Therefore 

(2.54) MILFtlloo] < oo , V t < ^ for all iV e N. 

s 

We now have to extend (|2.38[) to alH > 0. We argue as follows. 

First using Theorem 4.11 in [EK2] with T T = 2 Nt and T < so that E[2 Nt ] < oo, we get 

(2.55) E Xo [H(X t ,(r,o,T ))} = E Vo . ro [H(X , ( Vt ,T t ))l if t < T, 
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and therefore the martingale problem for X is wcllposcd and has as solution a Markov process (in 
fact a Feller process). Furthermore we know that we can write (this is implied by the form of the 
selection transition which at each birth creates as new state a sum of two functions derived from 
the old function), 



(2.56) T t =F(t,( m ,F ))=J2^ 



4=1 



Then we know from the Markov property of the dual process and its form that 



N t 



(2.57) E 2t =F(t,(r) U F t )) fa, F^)). 



Then observe that the Markov property of (X t )t>o allows to calculate as follows. Using the duality 
for time t to time 2t (here E denotes expectation over the dual dynamic) 



(2.58) 



E[H(X 2tl (r, ,E ))\X }=E[E[H(X 2tl ( VQ ,Eo))\X t }\X } 

= E[E[H(X t ,(r) t ,F t ))\(r)o,F )}X ] 



E[E[H{X U ( Vt , £ Tt,i))Kvo,Fo)}\Xo 



Now we calculate the r.h.s. of the above equation as: 



(2.59) 



E[E[H(X t , £ ^m)]I^o](??o,^o)) = E 



N t 



E[^(^o,fe,F(f,J- M )))|(r?o,-Fo)] 
i 

Nt 

H(Xo,(mt,J2F(t, (vuKMKvo^o 
»=i 



= E[E[H(X ,(m,J r 2t))\(Vt,J r t)]\(vo,J r o)] 
= E[H(X 0l ( mtl E 2t ))\(r, 0l Eo)i 

where we used duality between times t and 2i, the construction of the dual and its Markov property. 
Hence wc get the duality relation up to time IT. Iteration gives the claim for all positive times. 



2.3 Two alternative duals for the mutation component 

In order to handle certain applications where mutation is a key it is sometimes useful to modify 
the dual representation. Here we give two modifications of the mutation induced part of the dual 
as well as their combined version which will be crucial later on also for the refined version of the 
dual which is the main tool for the renormalization analysis. 



2.3.1 Modified dual for state-independent mutation component 

In the case in which there is a non-zero state independent component of the mutation mechanism, 
that is, m > and mM > ml ® p, p strictly positive on I, we can obtain a modified dual which 
is particularly useful when mM — m(\®p) = 0. 

Definition 2.6 (Modified mutation dual) 
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We define the modified dual process (rjt,J-t)t>o resp. (rjt,J 7 t)t>o( ! nt,Gt)t>o f° r a mutation 
matrix M satisfying mM > ml ® p as follows. First, we enlarge the space fijv to 

(2.60) f^Ul*}. 

In {*} all the transition rates of the dual particle process n are 0. In addition we set for the 
function-valued part M ({*}, {*}) = 1 and for consistency extend p by p{{*}) = 0. 

The dynamics of the process (rjt, Ft) resp. (%, Fj)~)t>o, (rjt, Qt)t>o * s now obtained by adding to 
the mechanism of n for partition elements still located on Qn (rather than *) 

(2.61) jumps of the partition elements to {*} after exponential waiting times at rate m 

independently of each other (and independent of those of all other transitions) and by changing the 
dynamics of Ft , Fj)~ , Qt by 

(2.62) replacing mMf generated by the mutation kernel M by the semigroup 

M 4 * corresponding to the transition rates (mM — ml <X) p). □ 

Remark 15 Note that fj t = (Ct,^t) and both components have a different law than (£t,7r t ), 
since in particular on * there is no coalescence taking place. This observation has important 
consequences. In the case M — m(l® p) = once a partition element reaches {*} this element does 
not undergo any further change and the dual process can be easily analysed since it is eventually 
trapped with all locations on {*}. 

With this modification the same duality relation between (X t ) t >o and (rjt, Jt)oo an d its variants 
holds if we enrich by an additional component associated with the site *, in exactly the form given 
in Proposition 12.51 respectively Theorem [2] Precisely: 

Proposition 2.7 (Modified Duality for state-independent mutation jumps) 

Let (X t )t> be as in Provosition \2.5\ satisfying fh > and extend it to a process on Qn U {*}. 
Here the state of the original process X(t) is defined in the additional state * for all times as the 
probability measure p on the type space which is the measure giving the state-independent part of 
the mutation rates. 

Let now (rjt, Ft), respectively (rj t ,J-^~), (fjt,G^~)t>o denote the modified dual processes defined in 
Wm) - V£m) . Then the analogues of WW ,1MB and (OTP hold. □ 

Proof First we note that we can decompose the mutation semigroup in the independent super- 
position of the state- independent and the stat-dependent part. 

Next note that the expectation of the newly introduced jump occurring in our test function is 
exactly given by the action of the state-independent part of the mutation semigroup. In particular 
we have not changed the expected value of the test function switching to the new dynamic. Since 
the duality relation involves on both sides the expectations we get the claim. Alternatively apply 
(|2.39[) and note that the state- independent part of the generator and the part for the jump to {*} 
in the dual satisfy this relation. 

2.3.2 A random representation of the (state-dependent) mutation term of the dual 

The modification we now describe is useful in dealing with mutation which is not state-independent, 
for example in our context when we have also rare mutations from one level to the next. In that 
case it is useful to change the dynamics of Ft ; , by replacing the deterministic function- 
valued evolution driven by the mutation semigroup as specified in Definition 12.41 or in Definition 
12.61 by a random and function-valued jump process. The process rj remains untouched. 
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Namely remove the deterministic evolution from (|2.30[) and replace it by the following Markov 
pure jump process in function space. Introduce the following jumps in function space for g G 
ioo(I fc ): 

(2.63) g(ui,u 2 , ■ ■ ■ ,u k ) — > (M i g)(u 1 ,u 2 , ■ ■ ■ ,u k ), k £ N, 

where Mj denotes the application of the operator M to the ith variable of the function. These 
jumps occur for a function / G L oc (I fe ) at rate m ■ k. This defines (uniquely) a Markov pure jump 
process with values in ((J n L 00 (I n ) starting from any initial state. 

Consider now the situation that T tl , £ t + depends on k variables. For each i E {1, • • • , k} the 
jumps in (|2.63[) occur at rate m and this jump time is independent of everything else. Therefore as 
long as the state of Tt, respectively is a function of k variables these random transition from 
(|2.63[) occur after exponential waiting times. We denote by 

(2.64) T t ,T+,§+ 

the resulting modified function-valued processes. Note that in particular we can assign then the 
mutation jump occuring always with a particular partition element, which will be of some relevance 
for the historical interpretation. 

Since the expectation of the jumps occuring in our test function is by construction given by 
the action of the mutation semigroup and since the duality relation only claims the identity of two 
expectations, we have not changed the r.h.s. of the duality relation and conclude the following. 

Proposition 2.8 All the previous duality relations remain valid if we replace (J-t)t>o by (J-t)t>o 
or {J-^)t>o by {F^)t>o, similarly (£7 t + )t>o by (G^)t>o and leave r\ untouched. The same holds for 
the process (ff t , Jt)t>o, (VtiJ 7 ^), (VtiGf)t>o from Definition\2J\ □ 

Remark 16 Note how here the interplay between selection and mutation is reflected in this form 
of the duality. For example assume that we have rare mutation events, i.e. with rates << s, the 
selection rate. Then: the rate of mutation events in the dual process is proportional to the number 
of partitions elements we have in the dual process and therefore rare mutations become visible in 
the dual process as soon as many births have occurred in the dual process due to a much higher 
selection rate, which then compensates via the large number of individuals a small mutation rate. 

2.3.3 Pure jump process dual 

We can use both constructions presented in the previous two subsubsections at once: 

Definition 2.9 (Pure jump process dual) 

In particular in combining the new representations of the state- dependent part mM — ml (Si p 
and the state-independent part in ml ® p we get a pure Markov jump process 

(2.65) (%,Ji) t >o, (Vt,Ft)t>Oi (vt,G?)t>o- □ 

It is the dual process on which we base our refined dual in Subsection 12.51 and it is also the 
version best suited for the historical interpretation since it generates a marked (locations, mutation 
events) random graph in which we can find the marked ancestral tree of a tagged subpopulation. 
(Compare Subsubsection 12.5^4")) . 
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2.4 Historical interpretation of the dual process 

Does this analytical construction above have a heuristic meaning or is it simply an analytical trick 
revealing an explicitly solvable model? Both these alternative situations occur in the theory of 
stochastic processes. In our model, however, the duality can be interpreted in a nice way if one 
considers what is sometimes called the historical process. In fact this duality can be extended to 
reach processes including more information about the individuals past (see fDGPj ). 

In order to understand this, one has to remember that the diffusions we work with arise as 
small mass - many individual limits of particle models. In such a particle model it is possible to 
follow through time, the fate of individuals and their descendants and define for each individual 
currently alive its ancestral path leading back to its father then its grandfather and so on where 
this ancestral path gives the location and the type of the ancestor at every time. In fact we can 
from a time horizon t backward generate the collections of ancestral path and their genealogical 
relation which define a tree or rather a forest of trees corresponding to different founding fathers. 
This way we obtain a random marked (with types and locations) forest. A more modest question 
would be to determine the types and the time back to the first common ancestor of the sample 
taken from time t (this means we only get the types at time t and not at all earlier times along 
the ancestral path.) 

How can we study this complicated object? For each given time horizon t we can zoom in 
on a finite subpopulation of tagged individuals, tagged in the time t population and trace their 
history backwards in time. Then we can ask whether this object can be generated by a suitable 
stochastic process, the backward process. Can we hope for this process to again be Markov and 
time-homogeneous? 

To understand this better, take the non-spatial case first. Ask the question: What are the 
types of a fc-sample of individuals from the time t population. To answer this question generate 
the law of the historical evolution of this subpopulation by running a stochastic process backward. 
Similarly in a spatial model one can take samples at different locations. A nice case arises if this 
backward dynamic of the sample alone is already a Markov process, which is time-homogeneous. 
It is in this situation that one traditionally speaks of the existence of a dual process. 

Resort first to a simpler, the neutral case. Due to exchangeability of individuals for the neutral 
case (i.e. no selection and mutation) such a Markovian backward process can be given based on a 
coalescent generating the family structure of the sample. A key tool to establish this in our context 
is the representation in terms of the lookdown process of Donnelly and Kurtz adapted to this spatial 
situation, (compare |GLW1 ) . This then allows to rigorously establish and identify the dynamics of 
the backward process in terms of a spatial coalescent ( |GLWj ) . This can be extended and one can 
show that the genealogical trees of the neutral model evolve in such a way that their state at a 
fixed time t is given by the genealogical tree associated with the coalescent, see [GPWmctric and 
|GPWmp| . 

In the case in which selection and mutation both depend on the type of the individual exchange- 
ability is no longer preserved and a complicated interaction between the tagged subpopulation in 
the sample and the remaining population arises, which is not anymore in law equivalent to an 
autonomous evolution of the sample. Hence in order to generate the genealogy by a backward 
process we have to add a richer device which consists of a reservoir of possible histories in order 
to still obtain a Markov process driving the backward picture. 

In the literature this problem has been treated first in models where the process and its dual 
can be specified via a random graph (graphical representation). In the case of the existence of 
graphical representations, such as the voter model or stochastic Lotka-Volterra models which are 
of that type and which appear in the models and work of Krone and Neuhauser ( |KN97| ) there 
are typically arrows between points in the random graph which might or might not be used which 
describe the possible action of selective forces. 
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In our context selection requires, as explained above, adding new individuals to our dual (sam- 
pled) population as we move backward from the time horizon which represent the potential insertion 
of a fitter type in the tagged population. Whether such a potential insertion actually takes place 
depends on the fitness of the " victim" in the tagged population and of the fitness of the potential 
intruder. This means that in order to decide the types of the fc-sample of the population we have 
to consider for selection a growing dual population representing typical individuals drawn from 
the population at a certain time s back. Then we have to assign weights to these possibilities 
representing the probabilities with which they are realised. Therefore the ancestral lines of our 
sample form a subtree of the graph generated by the ordered particle process rj. Among the possi- 
ble choices for subtrees in this graph the various possibilities have probabilities which we can read 
of from the function- valued part. 

In addition to the complication arising from selection, along the ancestral path mutation events 
have to be taken into account. For the state- independent part such an event decouples the final 
type from everything happening earlier, but if the mutation is state-dependent, we have again to 
consider the complete system of backward path but since the tagged population is small compared 
to the basic population a law of large number effect occurs and this can be represented through 
a functional dual reflecting the fact that mutations occur based on the current type and do not 
depend on the overall populations. However with the mechanism for the dual as described in 
Subsubsection [2\3.2I we can even associate with every individual a chain of mutation events. But 
we have not yet a rich enough system to associated with the ancestral path of the individual, a 
path in type space. We will see in Subsection 12.51 how this possibility arises. 

In the spatial context we have to sample k\ particles from a site £(1), k 2 from site £(2), • • ■ , k m 
from site £(ro) and the ancestral paths migrate in space. Altogether we therefore get a spatial 
coalescent with birth and with mutation operations associated with each ancestral path which 
represents the historical evolution of a randomly drawn fc-tuplc from the time t-population of the 
original process. Then we can use the dual to calculate the probabilities that /c-sampled individuals 
have specific types, a specific genealogy and paths in geographic space. We shall discuss more of 
this as we go along, see also the explanation of the refined dual from a historical process perspective 
in Subsubsection |2~5.41 

2.5 Refinements of the dual for finite type space: (rjt, J c t' + )t>o-, (Vt,Gt~ + ) 

In this section we focus on the case of a finite type space and we consider refinements of the dual 
(?]t,J-^), (rcsp. {rj u J-t) or (rjt,J- t ;)) and similarly with the versions using Q + ,G + , (rjt, which we 
denote Q+ + ) t>0 which mainly require an enrichment of the mechanism corresponding to selection 
and mutation in the function-valued part of the dual and which works (only) for a smaller set of 
functions in which the function- valued part can start, namely the function / in (|2.1[) has to be 
a sum of products of indicators. This set of functions however is still generating the full set of 
functions we consider and is therfore in particular distribution-determining for finite type space 
and hence suffices for a duality theory. However to guarantee that this subset of functions is 
preserved under the part of the dual dynamics corresponding to selection and mutation dynamics, 
we have to change the dynamics of (77, J r+ ) or (17, Q + ). 

In particular this duality allows for a nice historical interpretation, since it generates a marked 
graph (marked with types and locations) of which the ancestral marked tree of a finite sample is 
a subtree (see Subsubsection 12 . 5 .4[) . 



This constructions we describe in this subsection are written for the case of a finite type space 
(2.66) !={!,..., K}. 
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Remark 17 An extension to countable type space is possible, but not needed in the sequel. In 
particular if we assume more about the fitness functions, for example that fitness values have 1 
as the only accumulation point, then we can handle also immediately the case of countably many 
types. 

2.5.1 Ingredients 

For this refined duality we use a suitably chosen smaller set of functions as state space for the 
function-valued process. However the key point is to also use a modified dynamic of the function- 
valued process J-" 4 + , Q^f (later on called ,G^ + ) which is generated by a refinement of the birth 
process in rj t , the particle process driving the function- valued process together with a refinement of 
the mutation part of the function- valued part of the dual process. The point of these modifications 
(leading in a way to a more complicated dynamic), will be (1) the smaller subset of test functions 
is preserved and (2) this is suitable for development in [DGselj to consider the time-space process 
of this dual and use the additional information built into the dynamic to define a process on a 
richer state space (space of tableaus representing decompositions of the set I N ) which can then be 
analysed better if we are concerned with the longtime behaviour. 

In points (0) -(iv) we now explain step by step the needed changes in the state space and the 
four mechanisms in the dual to obtain the refined version. 

(0) State space of function-valued part 

The state space for the dual process better its function-valued part is the subset of functions 
on I N arising as follows. For k £ N, define first the space of certain finite sums of products of at 
most fc-indicator functions of a variable in / each: 

n hi 

(2.67) F fe := {f = Y TtlBi 3 -(«j).-Bi ; Q {1,2,---,K}, n£N,k = max ki}. 

I * — i— l,...,n 

i=l j=l 

Here / above is viewed as a function of I k . 

Remark 18 Note that the states in this set of functions need not satisfy J fdfi® k < 1, as would 
be the case if f defines a decomposition of (T) k . Therefore we have two cases, the version of the 
dual based on J- + where this is not the case and the dual based on Q + , where in the definition above 
we can impose the condition 

(2.68) J fd^ k < 1 

and still obtain a set of states preserved under the dynamics. 

The state space of the function- valued component of the refined dual process is given by: 

oo 

(2.69) F := |J F fc . 

k=l 

This function space is associated with the situation in which we have k particles in the dual 
particle process r\. For consistency the product running from ki to k could be filled up with k — ki 
indicators of the whole type space I. The parameter n allows us to consider the evolution of 
functions in which there is a mechanism that replaces a product of indicators by a sum of products 
of indicator functions. 

With each variable we associate a position in space. However this is not changed under selection, 
mutation or resampling. Hence in order to explain the dual mechanism for each of those we can 
ignore the spatial aspect for the moment. 
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We now explain the dual mechanisms corresponding to selection, mutation, resampling and 
migration in the original process step by step. 
(i) Selection. 

In order to describe the change of the transition in rj at a birth event and the new transition 
in the function-valued component of the dual upon a birth event we use the following structure. 
Assume that there are £ fitness levels reaching from to 1 and denote them by 

(2.70) = ei < e 2 < ... < e e = 1. 

(Note without loss of generality to assume for bounded fitness functions in assuming that e\ = 
0, e t = 1.) 

First change the dynamic of the first component, i.e. 77, so that the birth process is replaced by 
a multitype birth process, more precisely a (I — l)-type birth process. Births occur at total rate 
s but now when a birth occurs the type of birth (g {2, . . . ,£}) is chosen i.i.d. for this event with 
probabilities 

(2.71) (ei - ei-i)i= 2 ,-,e- 

We shall now introduce the jumps in the function- valued part occuring upon a birth in r] of 
specific type. For each tagged individual of a certain level of fitness say ej, all those individuals 
with a strictly larger level of fitness can be a candidate to take the place of the tagged individual. 
Define therefore for each level of fitness i, with i = 2, • • • , £ the set 

(2.72) Ai := {j e I : X (j) > ej. 

We note that A i+ i C A4. 

We define for a subset C C {1, 2, • ■ ■ , K} of the type space the operator ipc on F as follows. 
Let / be a function of /c-variablcs and define 

k 

(2.73) * c / := [lcW/(tti, • • • ,u fc ) + f(ui, . . . ,u fc )(l - lcK+i)]- 

m— 1 

Observe that (ipc — Id) is the generator of a rate 1 jump process on F. Namely denote l l c — 
lc(ue) and introduce for / € F^ for every variable Uj, j = 1, • ■ • , k at rate 1 transitions: 

(2.74) /^1 J C / + /®(1-1 J C ), ./ 1. ••••/■•. 

Then each jump from some / in F& ends in F^+i. 

We apply this to our context for C being replaced by the sets defined in f|2.72[) . we use the 
notation 

(2.75) xi = l Ai ( Uj ) 
and set for / G Ffe, 

(2.76) (#*/)(«!, • ' • , = ((Xif) ® li +1 ) + (1 - X\) ® f))(ui, • • • , u k+1 ). 
Again we can define an alternative transition as follows 

(Wf)(ui, ■ ■ ■ ,u k+1 ) = Xi(ui)f(ui, ■ ■ ■,u n )l(u k+1 

+(! - ■ ■ • ,Ui-i,u k +i,u i+ i, ■ ■ -,u k ). 

Remark 19 It is easy to verify that if < / fd^ < 1, then < / tp{' k fd^ ( - k+1 '> < 2||/||oo 
/or any probability measure fj,, respectively < J tp?' fd[i® k+1 < ||/||oo- 



2 FUNCTION-VALUED DUAL 



27 



Remark 20 Note that this definition means that if f is a sum of products that the jump occurs 
simultaneously in the corresponding factor in all summands. 

Remark 21 A different ordering of the factors and variables in different summands and a modified 
dynamics will be used later to couple the dynamics of different summands arising with every birth 
event. 

Definition 2.10 (Selection jump) 

Introduce transitions (jumps inW) for the function-valued part of the form: 

(2.78) f from F k ^F k+1 , 

whenever a birth of type i due to partition element j in rj occurred. (Recall here i is chosen with 

probability (e, — ei-i)). This will replace the transition we had in before. For we use 

% mk . □ 

Now the r.h.s. of (|2.25l) is interpreted as rate s births being of type i with probability (e, — ej_i) 
and leading to a transition given by (|2.78j) . 

The type of birth does not influence the further evolution of rj, it only will change the function- 
valued part occurring at this transition of rj t ; wc therefore do not enlarge the state space of the 
process rj to store the type assigned to the birth event. 

(ii) Mutation. 

We have to specify the action of mutations on functions / € i.e. functions of variables in 
the special case in which they are certain sums of products of indicators. We use here a refinement 
of the random representation of the mutation semigroup as used in Subsection 12 . 3 . 2l since the latter 
does not necessarily preserve indicator functions. We now construct a random indicator- function- 
valued jump process that represents the mutation semigroup (M t *)t>o- 

Remark 22 (Set-valued dual for Markov chains) 

The construction we give below produces in fact for every Markov jump process on a finite state 
space a set-valued dual process. Let E be its state space, then we can calculate p[Z t = i\Zo = j] = 
E[i € At\Ao = j] for a set-valued process, i.e. values in 2 E called (At)t>o with jumps and rates in 
Definition \2.11\ In particular we can calculate also its equilibrium distribution. 

This process acts independently on each variable of the function J-^ + (corresponding to par- 
tition elements of the dual process). That is, at random times the function J r t ++ or for our 
new function- valued process in the set of sums of products of indicators is changed by a jump from 
a product of indicators to a new product of indicators, where 

(2.79) all factors for a given variable change in every summand at once. 

We next describe the action of this modified mutation dual acting on one variable in the 
argument of / (to keep the notation simple we think of / as a function of one variable with the 
others fixed). 

The mutation semigroup driving the function-valued part of the dual process when acting on 
indicator functions can also be represented by an indicator-function-valued dual (random) process 
whose jumps are specified next. Later we extend this to the sums of indicators. 

We specify for each pair of types the jumps /(•) — > /(•) corresponding to the mutation 
transition i — > j, by the following prescription. Recall M — (mij)ij,...,K ■ 
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Definition 2.11 (Set-valued mutation jumps) 

This transition from type i to type j occurs at rate 

(2.80) m-rriij ; i,j € {1, 2, • • • , K) 

and results in a jump (depending on whether £ S {i,j} or not): 

(2.81) — > l{i}u{j} 

for i,j e {1,2,- ■■ ,K} and j ^ i. □ 

Next extend this to F. For this purpose let B C {1, • ■ •, A'} and consider the indicator 1b(u). 
Since this indicator can be written as 



(2.82) l B (u) = J2kk}(u), 



we continue the transition in (|2.81[) as a linear map acting on the indicators / = l/n(-), with 
t€{l,...,K}. 

More generally proceed as follows. The transition / — > / associated with the parameter 
is obtained by applying to / the matrix M (in {k,£)) which we define as 

This specifics the transition occurring in one of the several variables of /. 

Definition 2.12 (Refined mutation jump) 

We now obtain a function-valued dual process J 7 ™ 11 ' for the mutation semigroup acting on 
indicators or sums of indicators if we introduce the following collection of jumps for / £ for 
each I e {1,2, •••,&}: 

(2.84) f(ui,---,U£,...,u k ) — >^2Me(i,j)[ui,v]f(ui,---,v,---,Uk); i,j £ {1,- ••,#}, 

V 

at rate 

(2.85) mrriij, 

where indicates that M is applied to the Ith variable. □ 

For every variable £ these jumps preserve the sets for every k € N since the number of 
variables remains fixed. Hence the set F is preserved under a dynamic consisting of jumps like in 
d2H]). Therefore if T^ ut = f e F fe for some k, then J 7 ™ 11 ' is a F- valued process. 

(Hi) Resampling- Coalescence. 

Here as before we identify two variables located at the same site in the corresponding factor 
of all summands of the element in F^ we are dealing with. When coalescence occurs the resulting 
(coalesced) clement is given the lower of the indices of the coalescing elements and the remaining 
elements are reordered to eliminate gaps but preserve the original order. Note that this operation 
turns two factors 1^(1^), lgfuj), i < j each related to one of the coalescing variables into the 
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indicator of the intersection of the two sets as a function of the "new" merged variable. That is 
upon coalescence: 

(2.86) lA(ui)l B (uj) — >l A nB{ui), lc(ue) — ► lc(ui),l <£ 
Therefore again F is preserved since this operation sends 

(2.87) /gF fc — >7eF fc _ x . 
(iv) Migration 

The action of the migration is as before and has only to do with the process r\ t assigning 
locations to the variables but not with the jumps in the function- valued part. 

Remark 23 We can now proceed as in the definition of (rjt,J 7 t') o,nd add a state {*} to the 
geographic space where all rates are set equal to zero. Note that under this dynamic F is preserved. 

2.5.2 The refined dual or {n t ,g+ + ) t > 

We construct the full refined dual dynamics of the process which is denoted (774, J-" t ++ ) respectively 
(Vt,G? + )t) by setting: 

Definition 2.13 (Refined dual (rjt, J~^ + ), Vt, §t + ) 

(a) The process (rjt, J r ^ + )t>o *s defined as (rjt, J 7 i + )t>o except that births now have a random 
type as specified in 12. 88}) which changes the transition of J r t ++ at a birth event as specified below 
and the mutation transition is replaced by a jump process for the indicators in the representation 
o/J r t ++ . Similarly we proceed for {j] tl J r t ++ ) t > . Precisely the two changes are: 

(i) The transition occurring in the second component at a birth event is modified as follows: 

if .F t ++ G Ffe and a birth of a type i occurs at time t from the partition element with index j we 
have the transition for the factor f of the i-th variable (recall {2. 76}) ) : 

(2.88) / -> # fe / = xlf ® 1 + / ® (1 - x{), ( resp. for G++ : / — > X \f ® 1 + (1 - x\) ® /), 

where the factor (1 — x\) an d ^s (new) variable corresponds to the newborn individual. 

(ii) The mutation transition in the function-valued part is replaced by jumps according to [2.84}) - 

(EM> - 

(b) Similarly we define (rjt, J 7 t ++ )t>o or (rjt,G^ + ) if we have state-independent mutation at rate 
m as the corresponding modification of (j)ti J~t) respectively (fjt> Qt)- ^ 

The key feature of the refined dual (rj t , Q ++ ) is the fact that the terms 1a and (1 — 1a) which 
are created by births, evolve by further selection and mutation to terms being identically or 1, a 
state which we call resolution. At resolution time one of the two summands generated at the birth 
event disappears. 

2.5.3 The refined duality relation 

Now we can state and prove a duality relation between the process (rjt, J 7 t ++ )t>o and the interacting 
Fleming- Viot diffusion with mutation-selection. 

Theorem 3 (Refined duality) 
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Under the assumption of finite type space we have the following duality relation between the 
process (X t )t>o an d the two refined dual processes: if |£| = k and f £ then 

FMJ),X ) 



(2.89) - E ^ 

= E U,f) 



1 1 

J---J Qt + i u u ■ ■ ■ , u kt |)x Ct(1) (dui) • • ■ x MM) (du M ) 



and analogously for the state-independent mutation and (rf t , J~^ + ):(v> Qt + )t>Q- D 

Proof of Theorem [3j This follows from the previous duality relations by observing that 
we changed two things (I) we use the dual wc had previously now on a restricted class of test 
functions /, namely those in F which is a set of functions preserved under the dynamics. (II) we 
have reinterpreted this dynamic on F in an autonomous way, see (j2.92[) and (j2.98[) . such that the 
expected jump induced in the duality function remains the same, which we now explain in detail for 
the selection and mutation transition where the changes in the dynamic occured since the duality 
claims that certain expectations are equal it is preserved under the change. 

(1) First consider the selection transition. We can rewrite the fitness function x as follows. 
Namely 



(2.90) x = ^2{e i -e i -i)lA i . 

i=2 

At the same time (since e\ =0, ei = 1): 

i 

(2.91) l-x = X)(e i -e 4 _i)(l-l > i i ). 

i=2 

Now observe that the new state of J-" t + after a birth event increasing the number of basic particles 
from k to k + 1 can be rewritten as follows: 

i k 

(2.92) ( X f ® 1 + (1 - X) ® /) = - ^-O E ^' fc /] 
and 

(2.93) (e,-e i _ 1 ) i=v ./eP({l, •••,*}). 

We therefore can interpret the transition / — > (xf ® 1 + (1 — x) ® f) m the process J-" t + now 
differently as superposition independent transitions defined by which gives then the transition 
in the refined version denoted .F t ++ . 

(2) To prove this alternative representation of the mutation part of the dual process first note 
that we can write the generator of the mutation process, i.e. the jump process on type space for 
a fixed individual in the form of the independent superposition of mutation processes where each 
summand corresponds to the transition from type i to type j. This reads in formulas: 

K K K 

(2.94) Mm =j2mj)(m - /(<)) - E E m (^)m/w, 

j=l k,l=lj = l 

where 

(2.95) {M(i,j),i=l,...,K; j = 1, • • • , K; i ± j} 
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is a collection of kernels M(i,j)[-, ■] given by 

(2.96) M(i,j)[i,i] = -m i , j 

M(i,j)[k,£] = 0, for (k,£) different from or 

Note that each M(i,j) is the generator of a mutation semigroup with only mutations from type i 
to type j at rate m^j. Hence the collection above represents the collection of all possible mutation 
jumps at their rates. 
Note that: 

(2.97) M(i,j)[; •] = m itj (M(i,j)[; ■} - S [iA ), 

which means that this jump in (|2.81l) represents the generator M(i,j )[•,•]. This completes the 
proof of Theorem [3J 

Remark 24 Suppose only the mutation transitions occur. One calculates that the generator of this 
process is Y) t M.t where A4e is M. acting on the l-th component. Hence the duality representation 
of the mutation semigroup M t * acting on indicator functions is given in terms of the indicator- 
function-valued dual as follows: 

(2.98) M t * (f[ IbM) = E[7r*(xi, . . .,x L )\J^ ut = f[ l Bt {x t )]. 

\e=i I i=i 

This will serve as the mutation component in the construction of an indicator-function-valued dual 
for the mutation-selection-migration system. As before, in the case of a sum of products of indicator 
functions the jumps are made simultaneously in the corresponding factors in all summands. 



2.5.4 Historical interpretation revisited and outlook on a modified dual 

We can use the representation of J 7 t ++ given in the previous points to clarify the relation between the 
rchncd duality and the selection graph of Krone and Neuhauser |KN97j and concepts in population 
genetics referring to the calculations of probabilities for certain genealogical relationships. Of 
course there arc differences and specifics of our model, (1) we have multiple occupation of sites, 
(2) resampling only in one colony and in addition (3) we have taken a diffusion limit of many small 
mass particles. All this generates different features. 

However if we consider an n-sample and their ancestry we are back at a discrete model and 
instead of arrows attached to sites specifying potential insertion of fitter types in the graphical 
representation we generate potential insertions attached to individuals in the sample. However 
our backward process is a Markov process despite the presence of selection or mutation. This we 
explain now in detail. 

Given sets C± , . . . , C m the dual allows us to determine the probability that m individuals chosen 
randomly from the population have types in these sets and also the probability that they have a 
given genealogy. Already in the neutral model with mutation we have to use a function-valued 
dual (or for finite type space a set-valued dual) which allows to determine the probability that a 
given genealogy of the sample generated by the coalescent can result in a certain marking with 
types at time t. 

Selection complicates the picture even further. The calculations involving the new factors 
created by selection provide tests comparing individuals with the type of other randomly chosen 
individuals from the population. The corresponding probabilities are obtained by summing over 
potential histories. To be more precise consider the following. 
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We can now associate with our dual particle a marked graph as follows. Starting with the 
tagged sample we draw from every individual an edge, till the first selection or coalescence event 
occurs. If a selection event occurs we place a vertex from which two new edges start, one associated 
with xi, the other with (1 — xl) if the birth occurs with the i-th particle and is of type j. If a 
coalescence occurs we join the two edges and continue with one edge. On each edge we record the 
current location of the corresponding individual. Therefore edges are now marked with functions 
X J or 1 — x J an d with locations. 

At each edge we place at rate m • mjj mutation markers indicating a mutation event on the 
variable of the form i — > j. 

With this construction we have generated a random marked graph in which we consider all paths 
leading from the roots (the tagged individuals) to the leaves at time t. Every such path generates 
a product of indicators by multiplying the function at the root and the ^-functions attached along 
the way. Then act with the mutation markers on the factors attached with an edge. Finally sum 
over all these products corresponding to path connecting the leaves to a root. This is the state of 
J r t ++ . Note that if we put /o = 1, then we see that the expectation of the sum of factors arising 
from the birth events and undergo mutation have expectation 1. 

Can we associate with these objects, i.e. the marked random graph the marked ancestral tree 
for the tagged sample meaning we know the joint law of the genealogical distances between the 
tagged sample together with the type and geographical location at time t? This would require that 
based on the random graph explained above we can decide at each branching point which way to 
go. If we use J- ++ the obstacle is that we have x( u i) an d 1 ~~ x( u m+i) which since they belong to 
different variables do not define a decomposition in the potential set of marked ancestral path into 
disjoint sets. 

For the purpose of achieving this we use Q ++ instead. Suppose we have a birth due to particle 
number i of type j. Then we replace this transition by 

(2.99) f( Ui ) — > X j (ui)f(ui)l(u m+1 ) + (1 - X 3 )(ui)f(u m+1 ). 

The key feature is that selection introduces a decision tree with corresponding probabilities for 
the different genealogies which are possible after the interaction of the sample with the rest of the 
population. The selection transition / — > xf + (1 — x) ® / we defined for J r++ (and for F + of 
course as well) is changed in the new picture corresponding to Q ++ into (suppose here / = Is) 

(2 100) f — > lA i (ui)f(ui)l(u m +i) + lA j (u i )f(u m+1 ) 

= Uj-nflKJl^m+l) + 'i-A j (Ui)lB(y m +x), 

where now the two summands are alternative possibilities the ancestral path can take and their 
respective probabilities associated to the two new particles in each summand where one represents 
the preservation of the sample in the other the insertion of a superior individual in the sample. 

Recall now the coupling rule of the summands (the factors follow the transitions of the individ- 
uals they are associated with). We now see that the marked ancestral graph associated with this 
model has now the property there is at most one path from the initial particle "root" to the "leave" 
at the other end, since now x 3 ' (1 — X 3 ) = an d X 3 + (1 — X 3 )f < 1- The < sign appears since it 
might happen that not all n-path end at the leaves, which means that the tagged sample with the 
given type configuration and the generated genealogy is not consistent and has probability zero. 

In any case we can read each summand as a possible line of ancestry and the factors generating 
the probabilities. In the sequel we shall carry out and apply this construction at great length and 
introduce sums of ordered factors to parallel the genealogical relations. 

Here we give an explanation how the genealogy-type structure is generated. This means that we 
draw n individuals from the population and record their respective type and location and the time 
we have to go back for every pair to find the most recent common ancestor. The joint law of this 
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statistics we want to derive from the dual process. We start with the model (1) only resampling, 
(2) adding types and mutation, (3) adding selection. 

(1) The genealogy of a sample of n-individuals is obtained by considering the time when pairs 
of two individuals of the sample first land in the same partition element. This (™) different random 
times are in distribution equal to the entries in the matrix of genealogical distances with the sample. 

(2) Now the individuals carry a type and for a given genealogical tree we have to calculate 
the probabilities that we find specific types on the individual of the sample. For a realisation of 
the coalescent we can, with the function-valued dual, determine the probabilities of a specific type 
configuration at time t given the one at time 0. Namely with every partition element at every 
time s < t, there is a connected factor a function on type space which changes according to the 
mutation dynamic. Testing with the initial state the factor turn if the type is not possible and 
1 if it is possible. Note that if the set-valued process has reached the full or the empty set, the 
process contains no further information about the time further back in the original model. 

(3) Finally selection enters into the picture. The selection transition in the dual accounts for 
the possibility of an interaction of the sample with the rest of the population. In the neutral case 
this can be suppressed without changing the law since all individuals are exchangeable if they are 
at the same site at the moment resampling occurs. This is not true of course if we have type-based 
selection. Each action of the selection operation introduces two alternative forms of the genealogical 
tree of the sample by the interaction with a new randomly chosen individual from the population 
at the time the selection operator acts. Each realisation of all mutation transitions, coalescence 
events for an 7V-coalescent (in which the sample is embedded) fc-selection transition generate 2 fe - 
realizations of a marked random tree which are the potential genealogy-type configurations for the 
sample at time t. For each of those we can calculate the probability from a decision tree (whose 
leaves are possible genealogical trees) and where the edges carry certain factors which change 
according to mutation and selection. 

A key point is now that all the possibilities are alternative, i.e. only one such tree is the actually 
realized one for the sample. Furthermore the structure is such that a finite random time back it is 
resolved which of the different possibilities actually occurs. 

The following diagram illustrates the decision tree represented in the dual. Here the subscripts 
refer to the order of the factors and the selection operator has acted twice on the first factor and 
then the first and second factors have coalesced. We also used a coupling in which the operations 
are simultaneously applied to the same factor (according to the order) in each summand. The 
result of this history has two summands (1 — x) ® / an d xf ® 1- The indices 1,2,3 refer to the dual 
particle involved. This object can be viewed as a decision tree to decide which ancestral paths are 
possible for the tagged sample represented by / and every leaf representing a possibility. 

The following picture indicates this but note depicted is the decision tree and not the genealog- 
ical tree. In that decision tree we have first two selection events (the second applies to newborn 
particle) and then coalescence of particle 1 and 2: 
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3 Set-valued dual 

We consider now the case where I = {1, • • • , M, M+ 1} where M + 1 is the type of maximal fitness. 
Our starting point will be the function- valued dual processes (rj,T ++ ), (n,Q ++ ). 

3.1 Ordered function- valued dual 

We introduce now an order into our dual process, based on the historical information on the 
successive action of the selection operator. We treat T ++ and subsequently Q ++ . 

3.2 Modification of dual 1: .F t with ordered factors 

In this subsubscction we carry out the modification of the dual constructed in Subsection 

12.51 involving an enrichment (see Remark I25[) . In the case of our (M, l)-model we introduce here 
a simplified version applying for the case of M lower level types and one higher level type M + 1 
with only up-mutation to the higher level. 

Remark 25 That enrichments of dual processes give again duality relations is a general principle. 
For this we recall the general form of a duality ( see \U.1\) and note that if we have for X a dual 
process Y with a duality function H(-, •) and if we can introduce a new process Y* on a new state 
space £"'* such that there exists a map 

(3.1) K : E'<* -> E' such that £[«(F t *)] = C\Y t }\ 

then we have a duality with duality function H* : E x E''* — > R by setting 

(3.2) H*(-,.) = H (.,«(•)). 

Remark 26 Typically such enrichments arise from a duality relation of the new dual to an en- 
riched original process. We no not use this fact here and hence do not prove such a relation 
here. 

We proceed in six steps. 

Step 1: Preparation: further observations on the nature of J r t ++ . 
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As a preliminary step to the construction of the modified dual process that we shall use in the 

•+ 
i 



case M > 2 we now look in more detail at the structure of the function-valued part -F t ++ of the 



dual process. For that purpose we view J 7 t ++ not only as a certain function but we explicitly work 
with its form as a sum of products of indicators, in other words this form will become part of the 
state, see (|3.5p below. 

This information is based on historical information about the process rj, namely the information 
about the complete ancestral relations in the particle system rj. 

First consider the special case M = 2. This can be expressed as follows. Define the abbrevia- 
tions: 

(3.3) h = (110), h = (100), h = (010), U = (000), 

h = (HI), h = (001), h = (OH), fa = (101). 

Let F = (/i, . . . fs) denote the set of possible factors. 

Then starting with J-q + = (110) and applying selection and the lower level mutation operations 
we have 

(3 4) J- ++ = (^) f ® fcl ^® ft2 ^' Ji *) j-^M*,.?'*) j® k i{i,i,t) ._ -p++ 

i j=l i 

where Nt denotes the (random) number of summands and 

(3.5) fci,fc 2 ,fc 3 ,fc 4 : {1, •■■,&} x {1,2, ■■•,iV} x [0,oo) ^ N , 

are appropriate (random) functions which depend on t, for example, ki(i,j,t) is the number of 
fx = (110) factors in the ith summand at time t located at j G {1, . . . , N}. We can encode all 
information by coding J r ( ++ in the form of a different type of object (misusing notation since we 
refrain from using a new letter, since we go even further below introducing J r++ ' < , where we 
explicitly spell out the dynamics: 

(3.6) = (N t , {(h(i,j, <),•••, M», i, *)); i e {i, • • • , N t },j S {1, • • • , N}}). 

We note here that whenever a birth event in 77 occurs and the selection operator acts on J r t ++ , we 
can get new summands. 

Remark 27 Note we could have if 1^/ = that 

(3.7) /— >U/+(1-U)®/=(1-1a)®/ 
and even so a birth occurs we get no new summand. 

We can perform calculations by expressing the dual expectation in terms of a suitably chosen 
population of factors, which gives the dynamic of the (kx(i,j,t), ■ ■ ■ ,k4(i,j,t)). In particular we 
now consider the population of summands and the dynamic of the exponents in each summand. 

Step 2: (The basic idea: passing to tableaus) 

The key trick is to order factors and introduce factors 1. This is based on the following obser- 
vation. We can also keep track of some historical information in order to deal with the sum on 
the r.h.s. of (|3.4p . In particular we introduce an ordering of the factors that allows us to associate 
factors in different summands. First note that a product of indicators can change into a sum 
of products of indicators only by the selection mechanism and all other transitions preserve the 
product structure of the indicators of a summand in the J-^~ + respectively Gt~ + dynamic. 
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Recall next that selection operates by 1a • /+ (1 — 1a) ® / with 1 — 1a corresponding to a new 
variable. Observe first that we can write without changing integrals but having the same number 
of variables for both summands as 



where with M = 2 types we get 1a = (Oil). Then it is useful to place the new factors 1 and 
(1 — 1a) not at the end but next to the factor giving birth. 

Remark 28 Recall that in the Q ++ we have reordered the factors so that in all summands 1a • f 
and (1 — 1a) always are associated with the same variable, we call this having the same rank. Then 
we can couple the operations on different summands in a different way so that both are operated 
on simultaneously by the mutation, selection and migration operators acting at a given rank (and 
not particle). This will be exploited again in the next sub sub section. 

From (|3.8[) we sec that we get a sum of two terms now involving a new variable corresponding 
to the particle added by the birth event in the dual particle system. As time goes on this produces 
a binary tree and each path in this tree starting from the root and ending in a leaf corresponds 
uniquely to a summand. 

In general and more precisely we make the resulting two summands comparable in the sense 
that they have the same number of variables in both summands in (|3.67[) . but without changing 
the value. To do this we write this action of the selection operator as follows. Take a function 
f = f 3 <E) f 1 <E) f 2 , where f 1 is a function of one variable, the one on which selection acts and 
f 2 , f 3 of the remaining variable ordered in the order of the corresponding partition elements in the 
ordered particle system. Write: 



where 1 stands for (111), in general (11 • • • 1) with M ones, which integrates to 1 for any probability 
measure. This means that we have inserted a new variable in both summands. This allows us then 
to associate factors in the two summands after the selection jump one to one starting from the 
initial factor and ending with the last born variable. 
To formalize the above ideas we need two changes. 

(1) We want to view the function J r f ++ not just as a function but to add as part of its description 
some additional information on its form. Namely, we want to consider a sum of products of 
indicators where each factor is associated with a particular particle in r\ and the particles are 
ordered in a certain way related to their appearance as selection acts. 

(2) This means we can enrich the to a marked tableau whose rows correspond to summands 
and columns to factors (which are indicators) each of which is a function of one variable. 

For this purpose we need to order the factors and to introduce new factors of 1 where necessary 
in order that all the summands in the expression for J r t ++ consist of an equal number of factors. 
This leads to a collection of factors which are ordered, carry a location and are organized in 
summands. This object we will call J r t ++,< . In the next two steps we introduce this new process 
rigorously. 

Step 3 State space of the ordered dual J r ^ + ' < 

Next we introduce the ingredients necessary to construct the modified dual, J r t ++ ' < formally. 
This means that we add to the particle system a further function assigning each particle a rank, 
an element in N which defines automatically an additional order relation. (Note that we use here 
a different order than in Subsections 12.1112731 ) 

Start by observing how selection acts in order to see how the order must be set up. An initial 
set of factors f 1 ® f 2 ■ ■ . ® /" each corresponding to one single variable is given a linear order 
(from left to right). The new particles appear by birth naturally ordered in time. However we now 



(3.8) 



/-Ha-/®1 + /®(1-1a), 



(3.9) 



/ -> f 3 ® (1 A ■ f 1 ) ® 1 ® f 2 + f ® f 1 ® (1 - 1a) ® / 2 , 
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associate with a factor a rank which introduces then a new order by rank. The rank is defined 
dynamically as follows. 

Definition 3.1 (Order relation among factors, ranks, transition under selection) 

(a) We start with giving rank 1, • ■ ■ ,Nq to the No initial particles and the associated factors. 
New factors are created by the selection operator as follows. 

Suppose there is a birth by selection operating on the j-th particle in rj and the corresponding 
factor is in the i-th rank andm denotes the current number of partition elements. That is, selection 
hits the factor with rank £ and produces the transition of f 1 ® ■ ■ ■ ® f m to: 

f 1 ®--- f- 1 ® (fl A ) ® 1 ® f +1 ®---®f m 
^ 3 ' 10 ^ +/ 1 ® • • • f 1 " 1 O /* ® (1 - l A ) ® f +1 ®---®f m . 

In order to work with an ordered object we use the rule that the additional factor 1 is placed directly 
to the right of the factor on which the operator is acting and the remaining factors are shifted one 
unit to the right while 1 and 1 — 1a o.re put at the (£ + l)-th position. 

(b ) We denote this order of factors of one variable by 

(3.11) <, that is, f 1 < f 

means that the factor f 1 lies to the left of the factor f 2 in this order. 

(c) We shall relabel all factors counting left to right and we assign the factors the 

(3.12) ranks 1,2,3,- •• 

with the natural order on N. □ 

Note that in this labelling the order in rank is inherited. Therefore in Q3.4[) each of the X^=i kg(i,j, t) 
many factors has a rank in the linear order. This means that for each summand the factors are 
assigned ordered by their ranks. This leads then to the following description. 

Definition 3.2 (State description of the ordered dual (77', J r t ++,< )) 

(a) The state of the ordered dual has the form of a marked tableau, where the tableau is given 

by 

(3.13) {<pi(h); i = l,---,N t ; k = 1, ■ • • , N t }, 

where rows corresponding to the index i represent summands, and columns corresponding to the 
index k represent the factors of a given rank in different summands furthermore every column is 
assigned a location in {1, • ■ ■ , N} by the ipi as well as the type of factor in position (i, k) in the 
array and assigning the rank to dual particles. 

A row of the tableau can be split into a product of a set of sub-products of factors, one sub- 
product for each occupied location. 

We denote ( a factor is a function I — > K + ) 

(3.14) F = the set of possible factors. 

(b) Then the state at time t can be uniquely described by a collection of 

(3.15) Nt summands (rows) of N t factors (columns) 
and the state is denoted 



(3.16) ie{l,--,N t }}, 
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where each summand is associated to a map 

(3.17) ifi-. {l,--.,JV t }— >Fx{l,-..,iV}xN 

specifying separately for each summand for each factor the type of the factor, the location in 
geographic space and finally rank in the order called < . We write 

(3.18) <Pi(k) = (f k ,j k ,£ k ) , fc = l,...,JV tj 
with the constraints that 

(3.19) f k = fk> and j k = j k > if i k = 4'- 
We set 

(3.20) J-+ + '< = {ifi(k) = (fa,j k ,h) ■■ k < N t , i = 1, . . ., N t }. 

(c) At time we set Nq = 1 and the particles {1, . . . , Nq\ are assigned an initial index and the 
particles {1, . . . , Nt — Nq} are indexed in order of their birth times, the first birth assigned index 
Nq + 1, etc. The rank assigned to a particle changes dynamically due to coalescence or other births 
to be described by the dynamics below after fr3.27\ ). 

(d) The set of particles assigned the same rank corresponds to the partition elements defined in 
Subsection \2.1\ As in Subsection \2.1\ the set of partition elements at time t is denoted 

(3.21) 7r t = (7r t (l) J .-- J 7r t (|7r t |)) 

and the partition elements have the form 

(3.22) n t (e) = {k:j k =j,£ k =e}, i = 1, . . . , \ir t \, 
where the partition element £ is located at site 

(3.23) iS)=3- 

Then ir t defines a mapping 

(3.24) 7r t :{l,...,JV t }->{l ) ...,|7r t |} 

where the partition elements are now indexed by the smallest index of the particles they contain 
and defines a mapping (giving the locations of partition elements) 

(3.25) i t : {l,...> t |}^{l,...,iV}. 

We define the enriched (from n) particle system 

(3.26) v't = {Nt,TTt,it,^t), 

that is, n' t given by the set of particles, their partition structure, locations together with the list of 
current ranks 5Rt of individuals. □ 



Step 4: Dynamics of ordered dual process (r( ,T t ' ). 

In order to clarify the ideas we start in (i) with an informal description and then we formalize 
this in (ii) writing down a Markov jump process by specifying the state, transitions and their rates. 

(i) The point will be to place the factors in the summands of J r t ++,< in a particular order 
useful for analysing the resulting expressions. We describe now informally the transitions of the 
dual ordered in this way. 
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The dual evolution can be described as follows. First the particle system r\. Starting with Nq 
initial dual particles each located at one of the sites in {1, . . . , N} at time 0, new particles are 
created by a pure birth process with birth rate sm when the current number of dual particles is 
m. More precisely, independently each particle gives birth at rate s. Each particle is located at 
one of the points in {1, . . . , N} and is assigned in addition a rank. Then the enriched evolution 
with the transitions by coalescence birth (selection) and mutation is as follows. 

Coalescence (two particles at the same location) effectively decreases the number of factors 
even though here we keep formally the number of factors and introduce an additional factor 1. We 
make the convention that upon coalescence of two factors we replace the one with lower rank by 
the product of the two factors and the one with upper rank becomes (111) (this is the analogue 
of the wellknown "look-down" process). In formulas, at the coalescence of individuals i' and k' in 
the particle system rj which corresponds in a particular summand to factors with ranks i and k (in 
the labelling (I3.12jl ) we get for the function- valued part the transition given by 

(3.27) f 1 g> • • • <g> f <g> • • • <g> f k ® ► f 1 <g> • • • ® (/'/*) ® • • • ® 1 <8 • • • 

with each factor we associate a rank and the new factor 1 gets the same rank which introduces 
then a new order by rank. The rank is defined as in a selection operation the newly created factor. 

Mutations act as before on factors corresponding to each variable independently and we note 
that in particular acting on (111) or (000) has no effect. 

Selection creates a new particle in the dual particle system n and at the same time the transition 
given in (|3.8[> occurs. 

The rank allows us to consider for a selection transition the new factors now next to each other, 
i.e. we have if the factor / gives birth now after the transition the sum of the two factors: 

(3.28) /®(1-U). 

(Recall 1a and 1 — 1a are connected with different variables). 

Ranks change at the same time as follows. The offspring of a particle of rank k is assigned the 
rank k + 1 and the ranks of all particles with ranks £ > k + 1 are reassigned rank £ + 1. We denote 
the resulting number of particles in the dual system rj t at time t by N t where N t — Nq denotes the 
number of births. Moreover at a birth time the offspring of a particle is located at the same site 
as the parent. 

To complete the description of the dual we must define the corresponding function J r t ++ ' < 
which is a function of Nt variables but has the form of a sum of ordered products of factors. The 
transitions of 7 r t ++,< are as listed above. 

We can now represent the dual expression J-^~ + via our tableau. By construction we have: 

Lemma 3.3 (Representation via tableau) 

Consider the dual process (j]' t , J~t < )t>o- Let pr denote the map associating with rj' the triple 
arising by ignoring the rank. Given the tableau associated with {rj',J- ++ ) as {ifi{k) = (fk,jk,^k) ■ 
k < N, i = 1, . . . , N}, we define for this state 

at M 

(3.29) ( V ,J*+) = (prO/). EII IK%W (/*(«*))))• 

»=1 j£S £=1 

Then we obtain a version of (j] t , J r t ++ )f>o- D 
Remark 29 Note that we can also construct a version of (rf t , J-^ +,< ) from the path (rj s , J r s f+ ) s < t . 
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It is often convenient to order the summands in the tableau in a specific way, i.e. instead of 
a set as in (|3.16l) we want to use a tupcl. Recall that the action of selection (corresponding to a 
birth event for r] t ) on a function / at a given location and rank is defined by (|3.28l) . This replaces 
a summand by two summands (one of which can be 0) in J r++-< and we must keep track of the 
summands. 

In order to keep track of the summands and produce a convenient tableau we adopt the following 
convention for ordering the summands. Starting with a single factor / operated on by selection as 
in (|3.28p we produce the ordered rows 

In general 

• The operation of selection at a rank is applied successively to each row starting at the top 
and then moving down to each original row. When it acts on the rank at a row it produces 
an additional row immediately below the row on which it acts. If it produces a row with a 
zero factor this row is removed. 

• Mutation and coalescence can produce a row with a zero factor which is then removed but 
otherwise they do not change the order of the rows. 

This means that we want to think of the 

, . state of (77', J- ++ ' < ) as a tableau where rows correspond to summands and columns to 

variables and marks on the entries indicate the location of the factor. 

The ordered summands allow to put the entries 1 in convenient positions as we shall see later on 
working with this object. 

Remark 30 We will denote this object, misusing notation a bit, again with (r)' t , )t>o and 

use whatever appearance is most convenient. 

Remark 31 A natural way to keep track of the summands is as the set of leaves in a tree. When 
selection event occurs at a vertex two additional edges and vertices are produced in the tree. The 
dual representation will then involve an expectation over the summands indexed by the set of leaves 
in the tree. This means that only the marginal distributions of the summands are involved, not the 
joint distribution of the summands. One possible choice for the dynamic is that different summands 
evolve independently. But then we must have a separate birth process ( associated to the selection 
events) for each summand. Instead in the construction below we have a single birth process but 
this process is applied simultaneously to all summands following the dynamics described below. 

(ii) Based on the ideas introduced above we now formally define the dual process (r/', J-^ +,< )) 
in the case of M types at the lower level and one type at the higher level by specifying transitions 
and transition rates. 



We next define the dynamic of the ordered dual first in 77' and then in J- ++,< , however this is 
only the formal version of what we described in point (i). 

We specify the transitions that result in an increase or decrease of the number of factors, namely, 
selection and coalescence at one site. This will produce an enriched version of the birth and death 
process used in the case M = 1, N t . We now describe the transitions in the dynamic step by step, 
first for 77' in (1) and (2) and then for J r++,< in (3). 
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(1) The pair (N t ,ir t ) of processes is defined as follows. N t is a pure birth process with birth 
rate at time t 

(3.32) fl.|7r t | 

and \wt\ is a birth and death process. 

We label the particles in the order explained above, that is, the offspring which results when 
selection acts on rank £ is placed in the linear order at rank £ + 1 and the ranks of particles to the 
right arc shifted one place to the right, that is, their rank is increased by 1. Each new particle 
produced when there are n particles is the offspring of a randomly chosen rank £ which is chosen 
with the uniform measure on 1, . . . , n. 

(2) The dynamics of the ranks is as follows. Deaths of ranks at a site are due to coalescence 
and migration and occur at rate 

(3.33) d I _ ) l m >2 4- cml m >i, (l m >i for N — oo in the latter). 



On coalescence of ranks £ and I' > I which are located at the same site the particles with rank 
£' are removed and the other partition clement at rank £ now includes all particles that had been 
in the two corresponding partition elements. 

When a death occurs, that is a rank is removed from a site, it is a migration with probability 

(3.34) m>i (l m >i for N = oo with the above convention). 

cml m >i + d\ 2 

The rank of a migrant is chosen with uniform measure on 1, . . . , |-7r t |. 

Remark 32 (a) The ranks {£,£'), £' > £ involved in a coalescence are chosen with uniform mea- 
sure on the set of m(m — l)/2 pairs of particles. Note that when m = 1 deaths due to coalescence 
do not occur. 

(b) Also we can adopt the convention that deaths at a site where m = 1 due to migration do 
not occur in the case of infinitely many sites and initial measures of the form /tt® N ; that is, case 
we adopt the convention that ranks at singly occupied sites do not migrate. Note that we cannot 
use this convention in the collision regime. 

(3) Turn now to the transition of J r++,< . 

We first give the description of the transition that results from selection operator associated to 
1a acting on a rank I at a site j. We first consider the action on just the i-th summand, namely 
that this summand is modified and a new summand is produced we denote by V and we describe 
below how to label the summands and the newly created ones. The original summand is modified 
as follows: 

{<Pi(k) = (fk,jk,tk)}k=i,...,N., as original state, 
number of particles changes: 

-4 N t + 1, 
function change to: 
<Pi(k) = (fk,jk,h), if4<^, 
Vi(JV. + l) = {l,j h ,£+l), 
<Pi(k) = (fk,jk,lk + l), Hh>t, 
Vi(k) = (l A fk,jk,lk), i££k = t 



(3.35) 
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Moreover for every summand a new offspring summand ipi> (if we use the special order of summands 
from (|3.30[) . i' = i + 1 and the number of all other higher summands is shifted by one) is produced 
which is defined as follows: 



(3.36) 



{w(*) = (/*.J*.4)}fc=i....^.+i 
is defined by 

Vi'(k) = (f k ,j k ,t k ), ai k <e, 
w(fc) = (/ fe! i fe ,4 + i), iu k >e, 
fi'(k) = (fk,j k ,l k ), xtk = t 
<p v (N, + 1) = (1 - l A ,j k ,£ + l), if 4 = L 



The transition due to coalescence of ranks I and £' > I at a site j is given as follows: 

{pj(fc) = {fk,jk,h)}k=l,...,N, 

changes to 

<P%{k) = (fk,jk,h), if 4 < £, 
(3.37) Vi(k) = (fff t ,,j k ,l k ), \il k =£, 

fi(k) = (f k Jk,h), ii£<£ k <i', 
Vi{k) = (fk,jkJk-l), ifh>£', 



Next we consider the transition that occurs if rank £ migrates to a new site say j': 
(3.38) 



{<Pi(k) = (/fc,ife,4)}fc=i,...,JV. 
changes to 

Vi(k) = (fk,jkJk), Mi,**, 
Vi(k) = (fk,j',£), \i£ k = L 



The collection of summands has a natural tree structure which is described as follows. We 
begin with 

(3.39) Nq = 1 and give this index 1. 
We index 

(3.40) the offspring of a summand of index i by il, i2, i3, . . . 

which then defines a natural tree structure. In this way we see that after N t — Nq = n births, there 
are 2™ summands. However we note that some summands can be so that this provides an upper 
bound on the set of non-zero summands. 



Remark 33 Assume we can work with the collision-free regime (N — oo). We can construct 
a richer labelling system that incorporates both the birth order and rank in the linear order, as 
follows. 

It can be encoded starting with the founder at 0. The history of transitions up to time t and 
therefore the state at time t is determined by a sequence {0 —>,...} of transitions, where each 
transition is given by one of 

(3.41) k ->•, <- £{ where £ > k) and k \ . 

The first corresponds to a selection event and produces an offspring at k+l, the second a coalescence 
(of £ with k ) and the third an emigration, i. e. migration to an new unoccupied site. Note that the 
founder position never migrates. We denote the position of the kth particle born in the linear 
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order at time t by v{k,t). We can have k <— I only if v(i) > k, that we, have a lookdown. The 
relative order of the particles does not change but when a lookdown occurs we denote the outcome 
as the "compound particle" [k,£], etc. 
For example 

(3.42) 0,0^,0^,1^,0^,1^,2^3 
would result in (the labels according to birth times) 

(3.43) 0, 01, 021, 0213, 04213, 042153, 04[2, 3]15. 

Note that in the special case with no coalescence this labelling above identifies the tree associated 
to a branching process and in the special case with no births identifies the coalescent graph (via 
the lookdown process). Therefore this can be viewed as a coding of the branching coalescing (or 
selection-mutation) graph. 

The following observation for the enriched process is important. Since the birth and death 
process is recurrent, when the number of particles at a given location returns to one, the sequence 
can be deleted and relabelled and the process begins again. 



Step 5 (Summarizing the construction) 

We can summarize the construction so far as follows. We have modified the (rj t , J-" t ++ )t>o to 
a new process (r]' t , J r t ++ ' < ) t > where (^)t>o is described by the enriched (by information on the 
rank) birth and death process (branching coalescing particle system) and the function- valued part 
is given by 

N t 

(3.44) (^ ++ '<) t > , J- t ++ '<=E^ +,< 

i=l 

where N t denotes the number of summands and the (possibly also ordered) summands are ordered 
products of factors. 

The tableau provides a nice way to to represent the state of J r t ++,< which is an array of strings 
of factors called rows. The columns in this array have a nice structure and correspond to factors 
with the same rank. For that purpose we had written at each selection event the new rows directly 
under the parent row. (See (|3.73[) for example). 

For convenience we denote the tableau associated to the object J r t ++ ' < as 

(3.45) t((^,J-+ + <<)), 

where a column corresponds to a variable and a row to a particular summand. 

Definition 3.4 (Tableau-valued process) 

The evolution of the Markov jump process (n' t ,J r ^ + < )t>Q induces a marked tableau-valued pure 
jump process driven by an autonomous particle process r]' t : 

(3.46) (t((^,J- t ++ -<))) t >o. □ 



We have for the modified process according to the Remark 14.131 the duality relation: 

Proposition 3.5 (Modified duality ) 

We have abbreviating the initial product measure state. 

(3.47) E[ f T+ + dx®] =E[[ J? + '<dx 9 ]. □ 
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The main idea for the analysis of (?/, J r++,< ) will be to complete the specification of the dynamic 
in such a way that the duality remains true for the new modified function- valued dynamic as well 
as getting cancelations of terms by coupling the summands for different i in the representation in 
(|3.4p by obtaining sums combining to give factors 1. What is in the way is that upon selection 
1a and 1 — 1a have different ranks. We defined in Section [5] a new process denoted by (£? t ++ )t>o 
and we define the enriched ({? t ++,< )t>o precisely in the next Subsubsection [3~3l by specifying the 
mutation and selection transitions such that they occur simultaneously at the same rank in each 
summand. 

Step 6 Simplifying the ordered dual to (?/,.7j + ). 

Our main application of the dual is to determine the emergence time scale. For this reason 
we make a few simplifications that are applicable for the calculations of the moments of the type 
M + 1 when we begin with an initial measure on (Am) n of the form 

(3.48) fj,® N with fj,(M + 1) = 0. 

Note that for the moment we have included summands that when integrated against /j,® n yield 

0, hence the dynamic can be further simplified if we are interested only in the integral on the r.h.s. 
of (|3.53[) . We observe that a summand which contains a factor 

(3.49) (0,...,0,*), with * = or 1 

becomes when integrated w.r.t. fi since we start the original process with all the mass on types 

1, • ■ • , M so that a summand with such a factor would not contribute taking the expectation. 
Therefore we can and shall agree to prune the summands in the dual process by setting 

(3.50) J^ t + i + ' < = 0, summand i contains a factor (0, . . . , 0, *) 

T+t ' < = Ft,? ' < . otherwise. 

With this convention we have to deal only with summands which have a contribution in the integral 
in (I3~47l) . 

Similarly the evaluation of ranks for which / = (1, . . . , 1, 1) in all summands does not change 
the integral on the r.h.s. of (|3.47p . However for the moment we do keep factors which are effectively 
one, i.e. (1, . . . , 1, 1) since below we shall see that this will help doing the bookkeeping once later 
on we couple summands. 

Furthermore note that the factor (0,. . . ,0,1) can only gain further l"s by a downward mutation 
and at the moment we consider rr^down = 0. (Even for rr^down > this would at best become 
visible if the dual process reaches a state where we have O(N) summands with such a factor). 

Lemma 3.6 (Simplified duality with ordered factors) For initial measures satisfying /i(A/ + l) = 0, 
we have the dual representation in terms of the simplified dual: 

(3.51) E\ j K + d(^)] =E[f Ft^di^)}. □ 

3.3 Modification of the dual 2: Q ++ !< with ordered coupled summands 

In the dual (?/, J- ++ - < ) factors and summands are ordered and part of the state and summands 
are coupled, since transitions in the particle system induce transitions in J-? +,< , which occur 
for certain factors in all the summands. The strategy of this section is to to use the new coupling 
between summands used by Q ++ , that took advantage of the structure of this dual which introduces 
cancellations which will make combinations of some summands in one variable equal to 1. This 
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can be generated using the process Q ++ in combination with the enriched particle system rf and 
the order introduced for J r t ++,< to arrive at 

(3.52) (r& $++•<). 

This construction will be the basis for the second key idea that will be taken up in the next 
Subsection 13.41 which is to introduce another dual object that allows us to represent the process 
with coupled summands as a nice set-valued process which allows to study the longtime behaviour. 
In our application we can also calculate moments and with enrichments of it we evaluate the 
Malthusian parameter /? that determines the critical time scale. 

The objective is to use the basic duality formula 



(3.53) E[ 



T+ + d{x(t)r\ = E[ I J? + d(x(0))®} = E 



++,< 



d(/x®) 



where J r t ++ is the refined dual process of Subsection 12.51 and to rewrite the r.h.s. further, in the 
sense that for every initial state x®: 



(3.54) E 



++,< 



d{x®) 



= E 



Step 1 (Construction of the new couplings of summands in the ordered dual) 
We obtain the new coupling of the summands on the level of the tableau- valued process by 
changing the rule of the selection transition, parallel to the change in Section |2] from F ++ to Q ++ , 
as follows. The action of selection operator acting on a particle with rank t and associated factor 
/ and location j produces now at j and the ranks t and i + 1 the factors induced by 

(3.55) /— »• W (8 1 + (1 - U) ® /• 

This means that now the factors 1a and (1 — 1^) have the same variable, i.e. they are in the 
same column and the same rank. In particular this means that under the further action of the 
migration, coalescence or the selection operator wc apply this simultaneously to the (1 — 1a) and 
1a in the product (1 — 1a) <8> / and 1a- f ® 1, that is, to every factor in the corresponding rank (in 
the linear order). This includes the transitions of these two factors in the two summands arising 
from coalescence, migration, selection and mutation, both normal and rare but for the moment we 
do not consider the rare mutation. 

Note that at this moment the variables of 1a and (1 — 1a) are associated by this modified 
mechanism, even though originally they where associated with different individuals in rj. The 
point is that taking the expectation of the sum w.r.t the dual dynamics we have not changed the 
value of the expectation since the expectation depends only on the marginals corresponding to the 
summands and not the joint law. This means that wc permute the order of the individuals in the 
?/-proccss depending on the summand, in order to make the cancellation effects explicitly visible. 



Remark 34 Note that 
changed. 



have the same type of state as J r 4 f+ ' < but the dynamic is now 



Remark 35 A key point is that with this new coupling the different (i.e. not repetitions of another 
one) non-zero summands always correspond to disjoint events. Also note that some summands can 
correspond to the empty event which does not contribute in the duality. 
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Remark 36 Note in the new process (Q t ' )t>o can be viewed as a process, where the birth, 
coalescence and migration transitions are now jumps associated with the ranks (recall ft3.44\ l)- 
Recall that the coupling of summands in h3.65\) was given by the evolution rule that every transition 
occuring for an individual in rj, say k of some particular rank say t occurs simultaneously in each 
summand that is for every .F i t ~ t + ' < in fr,3.44\ )- This induces a coupling of the rows in the tableau. 
Alternatively we can use rf as the driving process by permuting for each individual in rj 1 the position 
in the ranking to the factor it is associated with. 

Recall that rows which contain factors (0 ... 0) play no role in the duality we do not need them 
in any calculation and we can completely remove them which we did in passing from J r + + ' < to 
J-~l +,< . Similarly a column which has only factors (1 • • ■ 1) plays no role in the duality expression. 
We call these ranks inactive. 

Definition 3.7 (Non-zero summands and active ranks) 

(a) If we include only summands which are not 0, we number these as 

(3.56) l,...,iV t *. 

(b) Columns (ranks) in the tableau which contain only (11 ■ • • 1) are called inactive. In particular 
sites which are only colonized by inactive rows are called inactive. □ 

Definition 3.8 (The coupled dual process [rf ,G ++:< )) 

We shall denote the process arising from [rj' , t[[rj'J- ++,< ))) by introducing coupled summands 
via the rule i3. 55\) and by removing inactive ranks and zero rows by 

(3.57) [v't,Gt +>< )teR- 

The part Q^ +,< of the process consists of the collection of summands of marked factors 

(3.58) {Q++ :i=l,...,N;}, 

each consisting of iV t * active ranks and the associated factors with their location, thus specifying 
again a marked tableau of indicator functions 

(3.59) t(g++'<). □ 

A key property of the new dual is that it allows us in a more transparent way to keep track of how 
far our expression deviates from a product, which we summarize in the next remark. 

Remark 37 Recall from above that the sum can be organised with the help of a tree, a splitting 
occurring with each operation of the selection operator. Consider at time t + s the two summands 
corresponding to each of the two subtrees starting at a birth time s with selection operator (1^/ <& 
1/), respectively /® (1— 1a) (where the order of the factors is as indicated left to right). We have 
changed the dynamic such that we couple the summands such that effectively we work instead of 
this with 

(3.60) 1 <g> (1a/) and f ® (1 - l A ), 

which is then a decomposition in complementary terms since l A + [I — 1a) = 1- Furthermore 
if mutation acts on the partition A the decompositions (0,1) respectively (1,0) are traps and if the 
mutation rates are strictly positive these traps are actually reached. If the trap is reached ( we call 
this resolution) then only one of the two summands in h3. 60)) remains. 



We have the identity (which we prove in Step 2 below): 
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Proposition 3.9 (Tableau-valued duality with coupled summands) 

The process (n' ,Q ++,< ) gives the same expectation as the dual (r/, J- ++,< ) , namely: 



(3.61) E 



/(xx^W) = e jii^dip 



□ 



Remark 38 We have a look at the behaviour of sites contributing in the duality. 

In the two-type case we saw new sites being colonized by particles was always equivalent of a 
contribution of this site to the dual expression. This is not the case for the case of M > 1 and 
becomes very transparent with the new dual. Consider the sequence of transitions 

f 



(3.62) 



f-l B ®l+ 
l B c®f 



1 A 1 B ■ f ®l®l+ 
U<= ® Is • f® 1+ 
1a1b<= ® / ® 1+ 
® 1b- ® /• 

If the second rank migrates and then the first rank evolves by mutation via the jump Ia^b 1 
(hence 1a1b c , 1a c 1_b, 1a c — > 0), then the newly occupied site becomes inactive (—1). Hence we see 
that factors which have colonized a new site need not lead to a permanent colonization of that site 
as active site which is in contrast to the case M = 1. 

Note that the jump to 1 of a rank to the right of a migrating rank does not have this effect. For 
example consider 



P / • 1 B ® 1 + lj 



(3.63) 



IaIb •/ <8>1®1+ 
U<= ® 1b • /® 1+ 
U1b= ® f® 1+ 

1,4<= ®lB-®f 



Now first let the first column migrate and then the second column resolve l B — > 1. M^e t/ien obtain 



(3.64) 



Uls ■/(» 1(8)1+ 
1a=®/®1+ 
U1b= ®/<8>l 

UIb-/® 1+ 



1 



(AB)= 



fewi where now the first and second columns are located at different sites. However the first column 
is not removed. 

Step 2 (Alternative Proof of Provosition \3 .9}) . 

The statement can be proven viewing the process as enrichment of Q ++ and seeing that the 
latter satisfies the duality. We give an independent argument based on 7 r++,< . 
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We have to recall how we rewrote the state of J r t ++,< (compare (|3.44j) and (|3.67[) below). 
Consider the finite sum decomposition from (|3.44l) 



(3.65) J- t ++ ' < = E-^ + ' < ' 



where each Ff. t + is an (ordered) product of indicator functions. 

Given a realization of the particle system, we will build a version of F£ + ' K . Then in (15351) we 
can write for the r.h.s.: 



(3.66) E[f {^ + ' < }d(^)}=E^J{E^+}d(^) 























4 = 1 





where E denotes the expectation over the number of summands process (N t )t>a and E?j t = E{-\N S : 
< s < t) the respective conditional expectation with respect to the mutation, coalescence and 
migration dynamics. 

As in the case M = 2 we have that J 7 ^ K is decomposed into a sum and then 



(3.67) E 



We note that we have the following upper bound on the number of non-zero summands iV t * < 
N t < 2 Nt (since selection acting on a rank can replace each summand by at most two non-zero 
summands). We note that the sum over Nt or 7V t * results in the same integrand for J. 

Therefore given {N s : < s < t} we have to keep track of {J^ +,< : 1 < i < N*, < s < t} 
and the way they combine to form the sum. On the r.h.s. of (|3.67p we see that the conditional 
expectation depends only on the marginal distribution of the vector of summands and not on the 
joint distribution for the given birth process. This is where the coupling will come in. 

Since in the dual representation we compute the expected value of the sum this depends only 
on the marginal distributions of the dynamics (conditioned on their starting points) of the different 
summands after time s (recall (|3.67p . Therefore we can couple the evolution of the dynamics of 
the two summands differently and still preserve the duality relation. 

Observe here that the different transitions of the dual (rj't, J~^ + ' < )tem+ occur independently for 
each individual in the dual process and the dynamic of the dual system is Markov. Therefore if our 
coupling of summands is constructed by coupling transitions of factors in {J-" t ^ , i = 1, • • ■ , iV t *}, 
which correspond to different leaves i in the binary tree generated by the dual population the basic 
duality relation ()3.6f [) will be satisfied automatically. 



3.4 The set- valued dual as functional of the ordered dual 

We obtain now from (77', Q ++ ) as a functional a process Q ++ which is set- valued and turns out to 
have a Markovian dynamic which we exhibit. 

3.5 Dual Q ++ : Examples for the special cases M = 2,3 

To understand how this coupling construction of Subsubscction 13.31 might be represented as set- 
valued process we consider the example of M = 2, 3 and calculate what form the transitions take 
in Q^ +,< and indicate how we can represent the resulting process which we call Q ++ again in form 
of a functional of a Markov process and use this process as the new dual. These examples also 
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suggest the concept of resolved and unresolved factors, which is a key to the growth of the number 
of factor in the dual expression. 
The case M = 2 

In the case M = 2 we construct a Markov process with dynamics looking as follows. There 
are a finite number of types of factors, namely (100), (010), (110), (111). Starting with a factor 
of type (110) at a tagged site, new factors and a new summand are produced once a birth in rf 
occurs by the action of the selection operator. Factors can move to a new site, can coalesce or 
change due to mutation. The mutation can result in the removal of a summand as we shall see 
below, a phenomenon not occurring with one type on the lower level. We shall show below that the 
factor-valued process associated with a given site is recurrent and returns either to the state (110) 
after a finite random time or to (000) and the whole summand is deleted. During this sojourn 
some factors emigrate to new sites and the founding factor at the new site now (in contrast to the 
case M = 1) may become inactive and thus a new site can be lost again. The essential question is 
the rate of the successful emigration (establishing a new permanent site) which governs the growth 
of the number of the total number of (active) factors. 

Here are the transitions in detail. 

• Selection We first consider selection. Recall that the basic selection operator in Q ++ is 
given by 

(3.68) / -)■ (1 A • /) ® (111) + (1 - 1 A ) <g> / 

where 1a = (011) and the terms 1a ■ / and (1 — 1^) have the same rank in the order so that 
all other operations are performed simultaneously on them. Since 1a, 1 — 1a are of the form 
(011) respectively (100) they are not traps under the mutation chain (different from (110) 
disregarding rare mutation) and can be changed to (00*) or (11*), which are quasi-traps. We 
will therefore introduce the following concept: 

(3.69) (01*), (10*) are called unresolved factors. 

• Mutation We now include the action of mutation at rate (mi2 + 77121) which can resolve 
factors, i.e. a transition of the selection factors (011) and (100) to the quasi-trap (110) or to 
the quasi-trap (001) occurs: 

(3.70) (100) -> (110), (010) -> (000) with probability — , 

77112 + m 2 l 

or 

772 1 9 

(3.71) (100) -> (000), (010) -> (110) with probability , 

77112 + 77121 

(thanks to the coupling we use). This means that one summand is zero and the other has 
only the factor (110), which is a quasi-trap. Factors where the mutation has occurred are 
called "resolved factors" . These factors are important because they don't undergo further 
changes by mutation (except rare mutation), where with unresolved factors in a row we do 
not know yet if this summand remains or will become zero due to the action of mutation. 

• Coalescence Now consider the coalescence of rank ki and fe. If a rank ki looks-down to 
a position k\ < k2, then the column ki is removed and the rows in which the combination 
(010), (100) or (100), (010) occur are removed (remember that here we are taking the product 
of indicator functions). Note that if subsequently the resolution (100) — > (000), (010) — > 
(110) occurs at the rank k%, then the offspring of both original ranks are deleted. 
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• Migration Any factor can undergo migration, that is, move to a new site, but factors of the 
same rank do this together. For the moment we consider the case of infinitely many sites. In 
this case a rank always moves to an unoccupied site (but note that the rank of the particles 
it contains does not change). 

The importance of the location is that factors at different sites do not coalesce. 

We now make a number of observations being crucial in the sequel. 

Note that migrating factors can be one of three types (referred to as *-types) 

(3.72) (110), (100)*, (010)*, 

where for example (100)* denotes a factor that is currently given as factor (100) later on the 
resolution (100) — > (110) occurs, which makes the summand with (010) equal to zero and the one 
with (100) (which became (110)) acts as the original factor (110). Hence the * indicates that the 
factor has not yet resolved in (110) or (000). 

Note that we can decide on the final type of a factor already at the time of migration with 
probabilities ( — — ^ — ), ( — ^-k 2 — V Note furthermore that the choice of the final type also 
determines the fate of the particles to the right of that rank. 

Note that it is possible for a newly occupied site to be deleted if the founding rank is deleted 
due to resolution (100) — > (111) of an ancestral position (recall Remark l38j) . In this case the 
descending newborn factors then have no effect anymore, either a summand disappears or does 
not change the founding father. Hence we have to keep track of the ancestral relations in order to 
determine which factors need to be deleted! 

How do these transitions combine? Consider some examples. 

Three successive applications of the selection operator 1,4 = (011) to a factor /, i.e. successively 
to rank 1 can be described in the form of a tableau with 8 rows and four columns. Deleting now 
the zero rows we end up with the simplified tableau of only four rows as follows: 



(3.73) 







01 




01 


01 


(i- 




®\\ 


•/ 


01 


01 


(i- 




0(1 - 




®iW 


01 


(i- 


-A) 


0(1 - 




®(i-ii) 


0/ 



Here the first column corresponds to the initial position of / and the "offspring" of a selection 
operation is placed immediately to its right. Each column consists of indicator functions which 
induces a partition of the type space and the partition elements evolve via the mutation process 
but in such a way that the evolving partition elements at all time remain disjoint. 

Applied to the case / = (110), 1a = (011), (|3.73p yields the following tableau of indicators 
corresponding to subsets and inducing a partition of {1,2, 3} 4 : 



(010) 


01 


01 


01 


(100) 


0(010) 


01 


01 


(100) 


0(100) 


0(010) 


01 


(100) 


0(100) 


0(100) 


0(110). 



Consider the case in which at the second rank in ()3.74j) . we have (100) — > (000), (010) — > (110), 
by mutation. Then there are only two summands and the third and fourth ranks are inactive 
(and therefore columns) are deleted. This deletion of formerly active ranks due to mutation is the 
reason that the growth is slower than in the two type case. 

Associated disjoint decomposition of product of type set. 

In (|3.74|) each row corresponds to one summand in the dual Q ++ and each column defines a 
decomposition of the set of the M low level types. The collection of columns defines a decomposition 
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of the 4-fold product of the set of low-level types. The number of factors is a pure birth process 
with rate s as in the two- type case. 

More generally each state of the (k' x fc)-tablcau induces a decomposition of the k' -th product 
of the set of low level types, i.e. {1, 2} k and thereby also of the set {1, 2, 3} fc . The collection of 
rows of the resulting tableau corresponds for (|3.74p to the decomposition of {1,2} 4 given by the 
elements {x\, x 2 , x 3 , X4) G {1,2} 4 : 

{ Xl G {2}} 

( o 7 ^ u{xr e {i}}n{x 2 e {2}} 

1 ' u{ Xl g {l}} n {x 2 g {l}} n {x 3 G {2}} 

u{ Xl g {1}} n {x 2 g {1}} n {x 3 g {1}} n {x 4 g {i, 2}}. 

The case M = 3 

We next consider the effect of having M > 2 lower level types using the case M = 3 exhibiting 
the key features. The main new feature is that with M = 2 we had only one selection operator, 
since we had two fitness levels. In general M — 1 of the M lower level types have different fitness 
levels from 1. Having more such levels results in more possible ways for unresolved factors to 
reach their final resolved state in a quasi-trap called above *-typcs. Hence the essential difference 
between the case M = 2 and M = 3 is the resolution mechanism for the dual process which we 
shall describe in the case M = 3. 

Suppose the fitness of the four types is 

(3.76) 0, a, 1,1 with a E (0,1). 

Recall the upper level type has downward mutation at rate iV -1 while the three others have 
mutation rates 0(1). 

We need to analyse the dual starting with one particle and with: 

(3.77) g+ +!< = (1110). 

Most of the things said for AI = 2 remain but what we have to study in addition is what 
happens after the creation of an unresolved factor by the action of a selection operator, i.e. a factor 
not of the form (111*) or (000*) which are the quasi-traps, i.e. traps under mutation, excluding rare 
mutation. On this unresolved factors various actions can take place, mutations, further selection, 
coalescence and migration. The new feature is that the selection can act at different levels and can 
be combined with intermediate mutation steps. Consequently the process of resolution is no longer 
a 2 state Markov chain (with states (10*) or (01*), but requires paths of intermediate steps. 

We focus on this feature now and look at the transitions of resolved and unresolved factors step 
by step. 

• Coalescence and migration. This is exactly as what we described in the case M = 2. 

• Selection and mutation 

We focus first on the selection operator followed by subsequent mutations. We first note that 
we have two selection operators corresponding to the sets of types 

(3.78) A 2 ( fitness > a) and A 3 ( fitness > 1), 

which leads at rate s to the multiplication by (0111) resp. (1000) and by (0011) respectively 
(1100). Recall the choice between A 2 and A 3 occurs with probability a respectively (1 — a). 
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In particular we have for the selection operator with A = A3: 

(3.79) (1110) (0010) ® (1111) + (1100) ® (1110) at rates s • (1 - a). 

After a finite random time due to the action of mutation this yields either (1110) ® (1110) 
(mutation from type 3 to type 1 or 2) or (1110) ® (1111) (mutation from type 2 to type 3 
and then type 1 to 3) and this selection event is then in either case resolved since we arrive 
in a quasi-trap under mutation. 

If we use the selection operator A = A2 we have the transition: 

(3.80) (1110) — > (0110) (g> (1111) + (1000) ® (1110) at rates s- a. 

In other words, selection occurs at rate s and then the selection operator (0111) is chosen 
with probability a and the selection operator (0011) is chosen with probability 1 — a. 

Using the coupling between summands in Q ++ , eventually after a finite random time and 
several subsequent mutation steps (for example 3 — > 1, 2 —> 1 for the (0110) factor) the part 
of the sum due to the considered first action of the selection operator collapses by mutations 
to either (0000) or alternatively to (1110) (g> (1111) or (1110) ® (1110), and the factor on the 
r.h.s. of (|3.80p is resolved. 

What makes this dynamic difficult to handle? The problem is to determine what happens 
between creation of an unresolved factor and its resolution by subsequent mutation steps. As 
before when a position migrates we can decide in advance, i.e. at creation, which state will occur 
at resolution. But now to determine the probabilities of resolution in one of the possible states, 
namely, the two quasi-traps (111*) and (000*), we need to follow the intermediate states which 
were not present for M = 2. 

To do this we must work with a finite state Markov chain conditioned to be absorbed by one 
of the two absorbing states. Having first chosen this outcome we can then secondly replace the 
mutation Markov chain by its h-transform corresponding to the condition to reach the appropriate 
absorbing point. The overall effect of this does not change the law of the resolution process. 

For example the r.h.s. in (|3.80[) can resolve to 

(3.81) (1110) <g> (1111) + (0000) mutations 1 -> 3 or 1 -> 2. 
Similarly r.h.s. of (|3. 801) can resolve to 

(3.82) (000*) + (1110) <g) (1110) by mutations 2 ->• 1, 3 ->• 1, etc.. 

The length of such paths to resolution is in principle unbounded but is eventually absorbed in 
(000*) or (111*) and can pass to a finite number of intermediate states. 

In other words between birth or coalescence events each rank involves the dynamics of a partition 
of the set of types {1,2,3}. Moreover each partition element is associated to a subset of the set 
of columns of the tableau of factors in the different summands. The dynamics of the partition 
associated with a rank is given by a Markov chain induced by the mutation process. Note that 
the number of partition elements can only increase by a selection operation and can only decrease 
by coalescence in each case by one clement. Otherwise the partition process eventually ends in a 
quasi-trap and stays there until the next selection event at this rank. 

Note that a founding particle is not influenced by coalescence at that new rank and its behaviour 
is given by a Markov chain involving mutation and selection at that rank by the look-down principle 
in carrying out the coalescence (this uses of course the symmetry between individuals under the 
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resampling mechanism). We can use this as in the M = 1 case to obtain a simpler form of the sum. 
Namely we can determine its resolved value in advance and then follow the h-transformcd process 
as in the M = 2 case. As before we distinguish for each of the selection operators two versions 
associated with the two principal states for absorption but now instead of making the choice with 
probabilities 7711,2/(^1,2 + m 2,i) respectively ?712,i/(toi,2 + W2,i) we now use the two absorption 
probabilities. The states of the process before resolution is then described by the h-transform 
corresponding to the absorbing state chosen. 

Remark 39 In the new formulation of the dual process with coupled summands namely (rf,G ) 
it is important to code the new site according to whether we have a factor which survives or one 
which can be deleted after some random time. In particular in the regime where we have no collision 
we can do this easily since a site colonized will never be hit by another migrant ever, so that these 
two events do not depend on the evolution at other sites. For this purpose we shall later introduce 
classes of permanent, removed, and transient sites. A site is called permanent at the moment it 
is clear that it survives, it's called removed once it becomes deleted and transient after the time of 
birth, until it will eventually be deleted. Alternatively using the h-transform we can immediately at 
the time of colonization decide to assign permanent or transient to the site. 

Example for coalescence of selection factors. In order to get a better feeling for the 
process, we discuss a particular effect. Since there are more types we must also consider the result 
of coalescence between two different partitions. As we has seen above in the three type case it is 
necessary to determine the outcome of collisions of unresolved factors of different types. 

Now consider the case of selection operators corresponding to A = (0111), B = (0011) acting 
on two different ranks, say i < j, both with factors corresponding to (1110): 

ri*rt finnu (0110)0(1111) ri1im . (0010) ® (nil) 

(5.86) (LLW)^ lA ( 1OOO ) ( 111O)j UliUj^i B ( 11OO)0 ( 111O ^ 

Then after coalescence of the first columns sitting and after insertion of two ranks at ranks i and 
j + 1 we get the non-zero rows at rank i 

l A l B = (0110)(0010) = (0010), 

( 3 ' 84 ) (1 - 1 A )(1 - Is) = (1000)(1100) = (1000) 

U(1-1b) = (0111)(1100) = (0100). 

The following illustrates how effect of coalescing two columns followed by a mutation. Consider 
the tableau obtained starting with (1110). Then let the selection operator 1b act on column 1 twice 
and then the selection operator 1a acting on the third resulting column. This yields the tableau 
on the left in (|3.85[) . Then the coalescence of the 3rd and 1st columns results in the tableau on the 
right of p.85|) . Then mutation from 1 to 2 removes the last row and mutation from 2 to 1 removes 
the third row, resulting in the total change. 



(3.85) 



0010 


mi 


1111 


1111 




0010 


1111 


1111 


1111 


1100 


0010 


1111 


1111 




1100 


0010 


1111 


1111 


1100 


1100 


0110 


1111 


0100 


1100 


1111 


1111 


1100 


1100 


1000 


0110 




1000 


1100 


1111 


0110 



Then after resolution of the first column which would consist either of (1110) factors or (0000) 
factors, either the third or fourth rows can be deleted since after integration in the dual expression 
the result is not affected by these terms anymore. 
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We note that very long strings on the way to resolution are very unlikely since the mutation 
rates between the three lower types are strictly positive and all other rates bounded above by some 
constant, so that the quasi-traps for this mechanism are reached in finite time. We also note that 
at each step we have a finite number of possible outcomes. Therefore by Markov chain theory the 
waiting time needed to reach resolution and reach a quasi-trap has therefore finite mean. 

Conclusion from examples M = 2, 3: 

We can summarize the message of the two examples for the case of M-types on the lower level 
as follows. Again as for M = 1 we can simplify the dual expression but instead of analysing strings 
of factors we now work with tableaus of factors. In addition we have to distinguish resolved and 
unresolved factors and hence resolved and unresolved sites. In comparison with the case M = 2, 
for the general case M > 3 there are two new features in the resolution of the selection factor: 

(i) resolution does not occur at the first mutation, and 

(ii) there are a finite number of intermediate states, more precisely (2 M — 2) (here we have M lower 
level types)that can be reached by mutation before resolution. 

Nevertheless the observations on the form of Q ++,< in the two examples does suggest that our 
Markovian dynamics of marked tableaus corresponds to a decomposition of subsets of {1, • • ■ , M + 
1} N . This we will now formally introduce in Subsections I3.6ll3~7l for the case of general M. 

3.6 The set- valued duality 

We are now ready to set up formally the representation of (rf 1 Q ++ ' < ) by a marked sei-valued 
Markov process and in particular specify its state space. We make some definitions in a form, that 
it can be used in fact on any geographic space, which is countable. All the duality constructions 
we are carrying out in Subsections 13.61 - 13.71 work for every finite type space. 
Let 

(3.86) I = I M :={l,...,M,M + l}, 

(3.87) I := the algebra of subsets of I. 
Let the geographic space be S with 

(3.88) S = {1,...,N} orS = N. 

The states of (7/, (? ++,< ) can be coded as marked tableaus of columns of factors of one variable 
marked by a location, where a row defines then the product of factors. This defines a set-valued 
state 

(3.89) g+ + , 

as follows. The rows of the tableau (recall (|3 . 29[) ) correspond to the indicator function of subsets 
of (Im)" 1 for some m and the product over the marks gives then a subset of Y[ (^M) mj such that 

distinct rows correspond to disjoint subsets. Therefore the sum of the rows is the indicator function 
of a subset of Jj (Im)" 1 ' ■ It is often convenient to associate with this subset a subset of n 7 es(^M) N 

by considering 

(3.90) J](A,xI N ) , AtQ(I M ) m >. 
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Definition 3.10 (Marked set-valued process of Q ++ ) 

(a) The rows of the marked tableau l(G ++ ' < ) define disjoint subsets of 

(3.91) n^)" 4 f° r some { m i < °°hzs- 

J'GS 

The union of these disjoint subsets is a subset of Hj^s(^M) mj whose indicator function is given 
by the sum of the rows (where the factors in the row are multiplied corresponding to taking the 
intersection of the sets associated with the different ranks ) of the tableau. 

This subset ofY[j e s(^ M ) mj does not characterizes the state of (j] r ,Q ++,< ) since in general the 
ranks cannot be reconstructed from ranked subtableaus corresponding to different sites in general. 

(b) The set in A3. 89]) defines a new process (the state space will be formally introduced below) 
of subsets of Y[ (Ia/) N denoted: 

(3.92) (G+ + )t>o- □ 

Remark 40 Note that since the map giving Q ++ from the Markov process (j] f ,G ++,< ) is not 
bijective (at least if space contains more than one site, see above remark), the former need not 
be a priori Markov in a spatial situation. However we will verify the Markov property below in 
Proposition \3. 1 3\ 

We now introduce some needed notation: 

(3.93) T m ■= { the algebra of subsets of (Im)™}, 

(3.94) T := subsets of (I M ) N of the form A x (I M ) N with A e T m for some m, 

(3.95) TcT:=ff- algebra of subsets of (Im) N - 
The state space of Q ++ is contained in 

(3.96) I* = Algebra (^)Z,, where for each j, lj = T- 

jes 

Given a non-empty set G G I* let (recall we remove inactive ranks in Q ++ and hence in Q ++ 
we get only finitely many factors not equal to I) 

(3.97) \G\ := min{j : 3Sj = {s u . . . , Sj} C S : G = Gj <8 (I) s ^ with G 3 e (g) T} 

:= +oo if no such Sj exists . 

If \G\ = j < oo, the support of G, supp(G) is defined to be the set Sj that appears in ([3.9711 . 
Then we define 

Definition 3.11 (State space of Q ++ ) 

The state space of (Gt^ + )t>o * s defined as countable algebra of sets 

(3.98) / = l s := {G e /* : \G\ < oo}. □ 

Given the finite type spatial Fleming- Viot process with selection Xf = {x N (t, j)} je s S (V(Im)) s , 
we define the expression 

(3.99) X? := ]J(x(t, j)) m 6 (T((I M f)) S \ with 5 = the countable geographic space. 

jes 

The following dual representation is satisfied because of the very construction of the process 
Q ++ as functional of the dual Q ++ ' < : 
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Proposition 3.12 (Set-valued duality relation) 



(3.100) E Xo <x t ,g++> 



= E ? 



<x ,g++ > 



V5++ G / andX Q e (T{l)) s .□ 



The point in the sequel will be to formulate a Markovian dynamic which generates the process 
Q ++ in a transparent way and hence preserves the above duality relation. The Markovian dy- 
namics of the process Q^~ + is constructed in Subsubscctions l3.6.Hl3"77E rst without migration, then 
incorporating the latter. Then indeed we can prove in: 

Proposition 3.13 (Markov property of Q ++ ) 
The process (Gt~ + )t>o is Markov. □ 



In order to analyse the behaviour of the r.h.s. in ()3.100[) for our questions on emergence and 
fixation we have to classify factors of the geographic sites as follows. 

Definition 3.14 (Occupied, inactive and resolved sites) 

(a) A factor is called resolved if its indicator has the form (1, • • • , 1, *). 

(b) A site is occupied if there is at least one factor with indicator not equal to (1, . . . , 1, *) at the 
site. An occupied site is called resolved if it contains only one type of factor, namely with indicator 
(!,...,!,*), or inactive if it contains only factors with indicators (1, 1, • ■ ■ , 1). □ 



3.6.1 Preparations: Set- valued Markovian dynamics without migration 

We now have to find a mechanism which turns Q ++ into a Markov process. As a preparation for 
this analysis of the set-valued dual dynamic Gt~ + we begin by focussing on a single factor with 
subsets of I (corresponding in Q ++,< to a single column (rank)) since in this case we shall see 
that Q ++ is in fact a Markov process. The difficulty arises due to migration. First we analyse the 
behaviour of Q ++ under the action of the mutation transition at a single factor at a single location. 
Later we consider also selection and coalescence, in particular the interaction of mutation with 
selection. 

Where we had before with (77, or (n, Q ++ ) a collection of birth and death processes with 

values in N, we now get a dynamic on another countable set, which can be viewed as subsets of 
Ia/. Acting on each column we have a dynamic of subsets of 1m if j is the number of factors 
I. We study here these refined dynamics and begin by looking at the dynamics at a given factor 
I (i.e. rank), first under the effect of mutation, then only under selection and finally under the 
combination of both. At last coalescence is added in a straightforward way. At a single location 
we deal with subsets of I N as in (|3.94[) . Our subsets typically do not have product form but are 
disjoint unions of subsets of products of I. So the disjoint union induces a partition of I in each 
factor of I. Therefore we will study the dynamics of these partitions of I. We introduce migration 
in the next subsubsection. 

(1) Mutation Consider the mutation mechanism in the dual process (Gt~ + )> ignoring for the 
moment all other transitions. This induces transitions on indicators of subsets of I - see (|3.70|) . 
(I3.71|) . (|3.81|) . (|3.82j) for examples. In general, translating the transitions of indicator functions 
into transitions on sets we obtain for j G I: 

(3.101) A^AU {j} at rate ^ m o,^ A -> A \{j) at ratc X! 

teA ieA c 
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This defines a Markov process on subsets of I which we denote by (here m stands for the matrix 

( m i,j)i,j=l.---.n)'- 

(3.102) (C(t))t>o. 

We want to define now a dynamic on a whole partition of I. Given a partition of I we can let 
the mechanism in (|3.101|) act simultaneously on all the indicator functions corresponding to the 
elements of a given partition. 

Note that for our model the order O(l) transitions do not change the (M + 1) component so 
that effectively this is described by a Markov process on the set of partitions of {1, . . . , M} with 
transition and transition rates given for every k = 1, • • • , n by: 

(3.103) (4...,A,)^ (A 1 ,...,A k \{j},...,A i U{j},...,A n ) at rate ^ m jif . 

This dynamic of partitions of the set I can be immediately generalized to a dynamics of tuples 
of sets which are either disjoint or repetition of another subset. (Note the union can be strictly 
contained in the type set). Given a k disjoint subsets with union {1,...,M} the process from 
(I3.101[) induces a Markov jump process 

(3.104) (C m (i))t>o, 

with values in fc-tuples of disjoint possibly empty subsets with union {1, . . . , M}. Similarly we can 
start with the elements of any partition of {1, . . . , M}. 
Then we can prove 

Lemma 3.15 (The set-valued mutation dynamics and h-transform) 

(a) The Markov process (C m (^))t>o *s a pure Markov jump process with state space given by the 
set of partitions of {1, ... , M}. The partition 

(3.105) (l M \{M + 1}, 0) is a trap 

Assume that the mutation matrix is strictly positive for all types i,j€ Im\{M + 1}- Then starting 
with an initial partition, exactly one initial partition element will grow to Im\{M + 1} and all 
others will go to the empty set. 

(b) Given the initial partition ({1}, . . . , {M})) of (Im\{M + 1}) consider the process 

(3.106) (cr(i),---,crw), 

where the {Cr'Wi ' = 1; ' ' ' > -^0 are possibly empty disjoint subsets with union Im\{M + 1}). The 
quantity Ct™(*) denotes the set of initial types that have mutated to type i. This is a Markov process. 
If the rrikj > for k, I £ {1, . . . , M}, then for every k S N, 1 < j < k the k-tuples 

(3.107) A* := (0, . . . , 0, l M \{M + 1}, 0, . . . , 0), with l M \{M + 1} at j-th position 

are absorbing points for the the process and with probability one the process reaches an absorbing 
point at a finite time. 

(c) Given 1 < j < k the process conditioned to hit the absorbing point A* is given by the 
h-transform corresponding to hj which is the Markov process with transition rates 

(3.108) ? £ 6 = M|L Cl)C , z/Z^CO^O, 
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where 

(3.109) . . .,-B fe )) = P(B 1) ..., Bk )({C ^j* eventually}). 

Denote the law of the h-transformed process by P h . Then the law of £ m is given by 

(3.110) J2 h J ph3 - D 

Proof of Lemma 13.151 

(a) Note first that in a mutation transition two types i,j are involved and hence it suffices to 
consider the effect for a set A and A c . Consider therefore the initial condition 

(3.111) HO) = (A, A c ), A c I M \{M + 1}. 

Then a mutation from type i to type j makes no change if i,j G A or i,j G A c . However if 
i G A, j G A c , then after the transition at time r, 

(3.112) C(t+) = (A\i,A c Ui). 

Therefore after each mutation, we have a partition therefore resulting in a partition- valued Markov 
process. 

The process continues until it reaches 

(3.113) C = (0, Ui\{M + 1}) or C m - (Im\{M + 1}, 0). 

Therefore given any partition exactly one partition elements grows to Im\{M +1} and all the 
other go to the empty set since we have assumed positive rates on Ij\/\{A/ + 1}. 

We can next consider the enriched process where we start with a complete partition of 1m- 
When type i changes to type j, i is moved to the partition element containing j. This process 
continues until there is only one non-empty partition element (these are the traps) . Note that the 
number of different non-empty partition elements is non-increasing and the process eventually hits 
a trap with probability one. 

(b) and (c) Relations (|3.108[) and (|3.110[) follow as a special case of Doob's h-transform of a 
Markov process ( [RWj . III. 45). q.c.d. 

If we now consider j-column. the mutation acts independently on each of these components. 
Therefore we have now all what we need to handle the mutation part of the evolution of the 
set-valued tableau. 

(2) Selection. Given a factors k, the action of selection on this factor produces a new rank 
(birth) placed at k + 1 and the rank k + I is moved to rank k + £ + 1 for £ = 1,2,.... 

Now consider the action of a selection operator on an indicator function at a factor at a site. 
Recall that selection acts on each factors with rate s and when this occurs the selection operators 
corresponding to Ai, i = 2, . . . , Af, is chosen with probabilities (e^ — e^_i) where {e{\ are defined 
in (12-701 . 

Then the action of the chosen selection operators lA e act by multiplication on the active ele- 
ments of the factor and intersects with A c t on the new elements of the factor as specified by (|3.35p , 
()3.36p . On the level of a set- valued dynamics, if B = U-ga H^ =1 Bi^ G 27\ then selection \a<, acting 
on factor j G {1, . . . , n} produces a transition to the disjoint union of ordered products 

m / j—l n 

(3.H4) ( u n B ^ x ( b u ni «) xix n b ^ 

yi=\ \k = l k = j + l 

(m / j—1 n 
u n^ x ^ x n^] ] 
i=i \k=i k=j 
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We can represent such a subset in a more transparent way in a reduced tableau consisting of subsets 
of I as entries, where the columns form a tuple of subsets of I taken from a partition of I and the 
rows define disjoint nonempty subsets of I rl+1 . 

This means we can define uniquely the evolution of a column of a tableau since the indicators 
of sets, say A, B, in a column satisfy either A n B = or A = B. Given a specified rank in a 
tableau at a site the above mechanism induces the evolution of the column at this rank where of 
course in the case of repetition of sets they follow all the same evolution. 

Remark 41 Note that the corresponding tableau can be rewritten taking advantage of the simpli- 
fications 1a 4 + Ia c = I, IaJ-C = if A and C are disjoint, which may reduce the number of rows, 
but of course not the number of columns. 

Remark 42 Consider the result of two selection operations acting on the same rank - we obtain 
four summands 

(3.115) (/Uns) ® 1 ® 1, (1-1b)®W®1, (1 - 1 B ) ® (1 - 1a) ® /■ 

corresponding to the decomposition of Im ■ If f = li the decomposition is given by: 

(A n B) U (A c DB)U (B c n A) U (B c n A c ). 

The further evolution under mutation then determines which of the four decomposition elements 
takes over, that is, which of these will grow to 1m\{M+1}. For general f we get more decomposition 
elements. 

(3) Combined effect of selection and mutation. The effect of selection is to produce 
new nontrivial factors (real subsets of (I). Since mutation can change factors, it can change such 
a factor into I or - when this happens the corresponding particle dies. We can then think of 
the selection process as a "proposal" process and then mutation is acceptance-rejection process as 
follows (in the case M = 2) 

{1,2} — ► {2}x{l,2,3}U{2}x{l,2,3} 
[ ' — > {1,2}x{1,2,3}U{1,2}x{1,2}. 

In the first case the new rank produced by selection if removed (rejected) by mutation. 

(4) Coalescence. Factor £ and £' > £ which are at the same site coalesce by a look-down 
process: the rank £' is removed and the factor £ is modified by taking the intersections of the 
corresponding partition elements - see (|3.37p . (As a result in the representation as irreducible 
tableau some rows and columns can disappear. For example, in the case M = 2 the row with (100) 
in the £'th rank and (010) in the £' th rank is removed on coalescence so that one row and one 
column is removed.) 

(5) Set-valued Markovian dynamic. Now we have defined all three mechanisms occuring 
in a single site model and we can combine this now to a set- valued evolution in T. We denote the 
corresponding set-valued dynamics (respectively partition- valued) combining mutation (rate m), 
selection (rate s), coalescence (rate d) as 

(3.117) (C t m ' s ' d )t>o,(C' s4 )t>o. 

The corresponding jump rates are denoted 

(3.118) { q f; c s ; d } c , C er- 
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By construction the evolution is corresponding to the evolution of a set-valued tableau. Starting 
with a single factor or finite number of factors, for example (1, . . . , l,0)® fe , this determines the 
evolution at the given site in our indicator-function-tableau- valued process t/ ++,< . This proves in 
particular Proposition 13 . 1 31 for the case of a geographic space with one single site. 

The set-valued process has the following properties. 

Lemma 3.16 (Local set-valued dynamics with no migration) 

(a) Consider the Markov process (C t m,s ' )t>o with d > 0, m > 0. This process is positive recurrent 
and returns to states A C I N where at most the first factor is different from I. 

(b ) In particular for our concrete model of M lower and one upper type the process returns to the 
state (1, 1, ... , 1, 0) (corresponding to (Im\{M + 1}) x I N j after a recurrence time called the site 
resolution time r which has a distribution satisfying for some A > and k the initial number of 
variables 

(3.119) E k [e XT ]<oo. □ 

Proof of Lemma 13. 161 We first note that the number of active ranks at a site undergoes a birth 
and death process with linear birth rate and quadratic death rate. Therefore the process will reach 
a state with only one active rank with probability one. But then there is a positive probability 
that the mutation process at this rank will resolve, that is, reach a trap, before the next selection 
event. 

To prove the existence of a finite exponential moment, note that due to the quadratic death 
term there exists ko such that for k > ko the birth and death rates are dominated by a subcritical 
linear birth and death process. Therefore if Tfc _i is the time to hit ko — 1, then there exists some 
A' > such that for k > ko, 

(3.120) J B fc [e A ' Tfc f- 1 ] < oo. 

We also note that (by recurrence) for each k <ko, there is a positive probability of hitting 1 before 
k + 1. This means that the process returns to k at most a finite number of times before hitting 1 
and this random number has a geometric distribution. Therefore starting at k < ko the number of 
jumps before hitting 1 is the sum of a finite number of geometric random variables. Since the time 
intervals between jumps are exponentially distributed, this implies that t± has a finite exponential 
moment. Noting that when the number of active ranks is reduced to 1 this corresponds to a column 
containing members of a partition of I\{M + 1} and therefore has union some A C I\{M + 1}. If 
this equals I\{M + 1} then r = n. Otherwise there is a positive probability that this rank will 
reach a trap I\{M + 1} or before the next selection event. This will then occur after a geometric 
number of trials. This implies the result, q.c.d. 



3.7 Set-valued dual with migration 

We have sofar shown that we have for the non-spatial process a set-valued dual process with a 
Markovian dynamic. We now turn to the spatial case, when migration can occur. A transition due 
to mutation, selection and coalescence only effects the site at which the transition event occurs. 
Moreover if factors between sites, then this property is preserved under these transitions. 
Recall that we denote by |<? t ++ | the number of sites with a factor of Qf + , see (|3.97[) for a formal 
definition. 

We now introduce the transition due to migration which is a transition changing both the 
originating (parent) site and the site to which the migrant individual moves. Recall that in Q ++,< 
this amounts to a column changing its mark from say i to j corresponding to a migration step from 
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i to j. More specifically at rate ca(i,j) a migration event from a specific factor at an occupied site 
i to a site j takes place. 

There are two possibilities on migration at a time t, namely in the first case j is an unoccupied 
site thus yielding (1) \Q^ + \ — \Qt-\ + 1) or 3 1S an occupied site in which case we have (2) 

\Qt + \ = \Gt + \- 

In case (1) the factor is removed at site i and all factors to the right are shifted one to the left 
and at site j this migrant is placed as the first factor. 
In case (2) we distinguish two subcases. 

If prior to the migration the state of <7 t ++ factored at these two involved sites, then at the new 
site we add the individual at the first inactive rank at the new site and have the product form of 
the new individual and the old state at the new site. 

If Qt + docs not factor at the site where the migrant appears, the situation is more complicated. 
This case will not be dealt with here but is developed in [DGselj . 

Altogether the transition for the set-valued process is well-defined by these finite rates and 
well-defined transitions. 

We denote the process and the resulting transition rates as (with E denoting the state space): 

(3.121) (Ct h>o, U C x> rCC'tE- 

Proof of Proposition 13.131 (Markov property) 

We have seen that the transitions that occur in the function-valued dual Gt~ + (restricted to 
indicator functions) translate into the transitions on the corresponding set. Given any set G G I, 
the set of possible transitions G — > G due to selection, mutation, coalescence and migration are 
well defined and the transition rate is finite. Therefore these transitions define a continuous time 
Markov chain on the countable space I. Therefore the functional of Q ++,< and the process 
Q ++ have the same law. qed 

3.8 Calculating with § ++ 

We first note that to carry out actual calculations it is often convenient to represent sets by their 
indicators. For example (|3.116[) then reads: 

(3.122) (110) -4 (010) ® 1 + (010)® -> (110) or (110) ® (110). 

Also we can then work with irreducible tableaus of indicator factors to represent subsets, see p. 1141) 
simple. 

We demonstrate this at some examples. However one should have in mind that the objects we 
deal with should be viewed as subsets. The art of using the duality is to use the right representation 
at the right moment. 

In order to make the effect of migration and its interplay with mutation more transparent, 
we give some examples for the main effect that new occupied sites are created by migration, but 
as a result of mutation the parent site the offspring site can be deleted at a random time or be 
permanent. 

Example 3 The effects of migration for Q ++ 

We consider the case where we have two typs of low fitness and one of higher fitness. Let us 
start with (110) and assume that 4 births occur due to selection acting at the ranks 1,2,3,4 and 
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assume that we simplify the tableau by carrying out as much sums as possible (so that each time 
here only one row is added), so we get: 



(3.123) 



/ (010)! 

(100)! (010)i 

(100)i (100)! (010)i 

(100)i (100)! (100)i 

V (100)i (100)i (100)i 



(010)i 
(100)i 



(H0)i / 



Here the missing entries are (111). 

If the individual at the third (local) rank below migrates to an empty site and then there are 
two selection operations at the new site (denoted by subscript 2) we obtain (representing the a set 
by its indicator function with each row given by a product of indicator functions and the different 
rows being the indicator function of disjoint sets): 



(3.124) 



/ (010)i 
' (100)i 
(100)i 
(100)i 
(100)i 



V 



(010)i 
(100)i 
(100)i 
(100)i 



(010) 2 
(100) 2 
(100) 2 



(010)i 
(100)i 



o 



(110)i / 



(010)2 
(100)2 
(100)2 



(010)2 
(100)2 



(010)2 



Here indicates that the two sites are coupled. This corresponds to the multisite tableau 



(3.125) 



If a mutation 1 — > 2 occurs in either column 1 or column 2, then the new site is removed, i.e. 
becomes (111) and therefore inactive. On the other hand if the mutation 2 -) 1 occurs in both 
column 1 and column 2, and the last two columns coalesce with one of the first two columns, then 
the new site is active and is decoupled from the parent site. 

One additional complication arises if one of the last two columns migrates before this happens. 
Then the future of this migrant site depends on the site 2 in the same way. 

(3.126) 



(010)i 










(100)i 


(010)i 








(100)i 


(100)i 


(010) 2 






(100)i 


(100)i 


(100) 2 


(010)2 




(100)i 


(100)i 


(100) 2 


(100)2 


(010)2 


(100)1 


(100)i 


(100) 2 


(100)2 


(100)2 (010)1 


(100)i 


(100)i 


(100) 2 


(100)2 


(100)2 (100)1 



/ (010)i 

(100)i (010)i 

(100)i (100)i 

(100)i (100)i 

\ (100)i (100)i 



(010)2 
(100)2 
(100)2 



(010)3 
(100)3 



o 



(110)i / 



(010)2 
(100)2 
(100)2 



(010)2 
(100)2 



(010)2 



o 



(010)3 
(100)3 



(010)i 










(100)i 


(010)i 








(100)i 


(100)i 


(010) a 






(100)i 


(100)i 


(100) 2 


(010)2 




(100)i 


(100)i 


(100) 2 


(100)2 


(010)2 


(100)i 


(100)i 


(100) 2 


(100)2 


(100)2 (010)3 


(100)1 


(100)i 


(100)2 


(100)2 


(100)2 (100)3 



If the mutation 2 — >• 1 occurs in both column 1 and column 2, and the last two columns coalesce 
with one of the first two columns, then the new first site is active. Then if the mutation 1 — > 2 
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occurs in the first column at the second site, then the third site is removed. On the other hand if 
mutations 1 — > 2 occur at all three columns of the second site then the third site remains active 
and is decoupled. 

Note that the decomposition of the indicator function to the sum of products of indicator 
function is preserved but the constraint on which other individuals the migrating individual can 
coalesce with changes. 



4 Application: Ergodic theorem 



In this section we demonstrate how to use the set- valued dual process to prove an ergodic theorem 
for the case 

(4.1) I = {1,...,M}. 

We shall also consider the meanfield process X^-modcl with geographic space {l,---,iV} and 
migration kernel a(i,j) = -h for i,j G {1, 2, ■ ■ • , iV} and the limiting model as N — > oo, the 
socallcd McKcan-Vlasov process. (For more on this process see ( |DGselj ) with marginal law at 
time t denoted £ t .) 

Theorem 4 (Ergodic theorem for M-type system) 
We consider the type space {1, . . . , M}. 

(a) Let N < oo and consider the exchangeably interacting system 
(A2) X? = (x N (l,t),...,x N (N,t)) G (A H ./, 

^ ' ' with c > 0, d > and m y > for all (i,j) G {1, . . . , M}. 

Consider the distribution of £ tagged sites £[(x (l,t), . . . ,x. N (£,t))]. Then for £ < N 

(4.3) £[(x N (l,i),...,x w (^))]^ M f/G7>((A M -i) £ ) ast^oo, 

where /U^' is an equilibrium state and is independent of G (Am-i) n ■ 

(b) Consider the McKean- Vlasov dynamic (Ct)t>o corresponding to above set-up on I = {1, ... , M}. 
Then 

(4.4) Ct =>■ Coo = Vfq ,ast^oo, 

where /i^ G P(Am-i) does not depend on Co and is the marginal of the unique equilibrium of the 
McKean- Vlasov process. 

(c) Now consider the £-dimensional marginal of the equilibrium measure P^ N , {fi^ q ' e }N£N from 
Part (a). Then for every £ G N, {£ < N), as N -5- oo, 

I 

(4.5) ^/(d Xl , . . .,dx t ) => H(»? g (dxi)), 

i=l 

where /x^ is the stationary distribution given in D 

(d) Consider the system X N (t) := {xf((;, t)}je{i,...,JW},^enjv with d > 0. under the above as- 
sumptions on the {m(i,j)} and Cj > for all j. Assume we have a spatially homogeneous and 
shift ergodic initial condition. 

In this case 

(4.6) C[X N (t)} C%» , ast^ oo, 



where CJ^ 1 G P((Am-i) ) is the law of a spatially homogeneous shift-ergodic random field, 
which is an invariant measure of the evolution and which is unique under all translation-invariant 
measures. 
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Proof of Theorem [4] 

(a) It suffices that the joint moments of the field {x N (j,t), j = 1, . . . , N} converge. To show 
this we use the set- valued dual Qf + ' N . It suffices to take the initial state for the dual given by 

£ rij 

(4.7) §+ + ' n = n n i ^ a ^ s 

3=1 i=l 

where 1 < • Tij = n* < oo where ti* is independent of N. This means at a collection of sites 
j G {1, • • ■ , £} we have rij factors. 
Case i: d = 0,N= 1. 

In this deterministic case it suffices to consider only n* = 1 since different variables in the dual 
do not interact by coalescence and hence evolve independently so that the product structure is 
preserved. 

First consider the case M = 2 where type 2 has fitness 1 and type 1 fitness 0. 

Start the dual with (01). (Note that this suffices in this case). Consider the tableau which 
results from three selection events before the first mutation event (here the special form results in 
the fact that selection creates only four nonzero rows): 



(4.8) 



(01) 


®1 


01 


551 


(10) 


®(01) 


01 


(8)1 


(10) 


®(10) 


0(01) 


<8)1 


(10) 


®(10) 


0(10) 


<g>(01) 



Observe next that the mutation 2 — > 1 replaces (01) by (00) and this removes one row and column. 
However if there is a single row, that is, the tableau (01), then the result is a trap and the value is 
0. On the other hand a mutation 1 — > 2 replaces (01) by (11) and this immediately produces the 
value 1. To see this note that the first 771.1,2 mutation terminates the process since columns to the 
left of the column £ at which the mutation occurs form {(01)^ U (10)^}. 

We can then view the number of active rows and columns as a birth and death process with 
birth rate s and death rate 7712,1 and which is also terminated forever at rate mi 2- 

We must now consider two cases s > 7712,1 and s < 7112.1. 

In the latter case the birth and death process is recurrent and returns to (01) infinitely often 
if no mutation occurs on each return. Since on each such return there is a positive probability of 
reaching a trap on the next transition, the probability that the tableau has not reached a trap by 
time t goes to as t —> 00. 

On the other hand if s > 7712,1 then there are an increasing number of rows each containing a 
factor (01). The probability that the next transition at this factor produces a (11) and terminates 
the process is positive. So again the probability that the tableau has not reached a trap by time t 
goes to as t — >• 00. 

Hence combining the cases the process reaches the trap with probability tending to 1 as t — > 00. 
The value of the dual expressions is or 1 irrespective of the initial value of the process X . Hence 
we have convergence of moments to a value independently of the initial state. 

Now consider the case M > 2. 

Again if the d = each birth factor (created by selection) is eventually resolved to (1, . . . , 1) 
or (0, . . . , 0). Since the probability of the latter is strictly positive there are at most finitely many 
which are not (0, . . . , 0). Also the initial column will finally by mutation reach the (1, ■ ■ • , 1) or 
(0, • • • , 0). Therefore after a finite random time we again conclude that Gq + ' = or 1 . Denote 
the probabilities that the process reaches the trap (0, . . . , 0), (1, . . . , 1) by q^, q\ respectively. Then 

(4.9) lim E[xf{t)] = q{ independent of x w (0) for every i e E . 
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Case ii: d > 0, N = 1. 

In this case we must consider the initial condition (|4.7|) for £ = 1 and arbitrary n* . 

Again the selection process creates births (i.e. new columns and rows in the tableau). One new 
feature is that a selection operation can produce more than one new row since the number of rows 
doubles at each step in which IaJ <8> 1 or (1 — 1a) <£> / does not contain a 1a/ which is identically 
zero. 

Since d > 0, the process |£/ t ++ ' W |, the number of active columns is always positive recurrent. 
Each time the number of factors reduces to 1, there is a positive probability that the mutation 
process will hit a trap before the next selection event. Therefore Gq + ' N will reach a trap at a finite 
random time with probability 1 and then assume the value Qq + ' = or 1 (see Lemma 13.151 and 
Lemma [3. 16[) . Again denote the probabilities that the process reaches the trap (0, . . . , 0), (1, . . . , 1) 
by go, qi (note that qo = qo(£, ni, • • • , ne) similarly qi). Then 

e 

(4.10) lim E[T](x? (t)) n ' (t) l = qi independent of x N (0). 

i=i 



This again shows convergence of all moments and the claim follows. 



Remark 43 The mutation among the first (M — 1) types can kill these extra rows but not the 
primary row added by a selection event. We illustrate this as follows. Let selection act with 1a on 
the column number 1 with A = 3,4,5. Then starting from the tableau 

(00001) (8)1 (8)1 ®1 

1 (8)1 



( 4 " n ) (11100) ®(00001) 
we get the new tableau: 



(00001) ®1 ®1 <8>1 

, , (11000) ®(00001) (8>1 <8>1 

[ ' (00100) ®(00001) (8>1 <8>1 

(11000) <8>(11100) ®(00001) ®1. 

The transition (00001) — > (11111) no longer automatically terminates the process but does cut the 
tableau at the level of this factor. 

Case Hi: d > 0, l<A<oo. 

In this case we must consider initial conditions given by (|4.7[) with initial factors at £ different 
sites and arbitrary rt*. In this case the dynamics at each site are positive recurrent and the analysis 
at a site is as above. However the new aspect is that before resolution at a site, migration can 
produce new occupied sites. 

First consider the case M = 2. 

As above each site will eventually hit (11) or (00). If it resolves to (11) the resulting value is 
1. If it resolves to (00) and produces no migrants before resolution the value is 0. If it resolves to 
(00) during it lifetime it can produce a random number of migrant sites. Each of these can resolve 
to (00) or (11). If the second resolves to (11) the resulting value is 1. If it resolves to (00) before 
producing a migrant site, the value is 0, otherwise it can produce a random number of offspring. 
As a result we have a growing number of visited sites. Since each new site can either resolve to 
(11) or resolve to (00) before producing an offspring with some probability p„ > the process 
will terminate after a random number of trials. If one of the initial sites resolves to (00) before 
producing a migrant the resulting value if and if it resolves to (11) the value is 1. Hence again 
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the process reaches the traps where <? t is the full set or is empty. Hence as above we see that all 
moments converge again which gives the claim. 
Now consider the case M > 2. 

Here the principle is the same, the number of columns is a positive recurrent process returning 
to 1. Then the process hits (0, • • • , 0) or (1, • • • , 1) with positive probability by mutation before 
any selection event happens. Then eventually the set- valued dual will hit a trap with all factors I 
or the 0-set (see Lemma 13 . 151 and Lemma |3.16[) and all the mixed moments are computed as above 
giving convergence to a limit independently of the inital state. 

(b) Now consider the McKean-Vlasov process in which we now use the set-valued dual with 
S = N. The new feature is that the dual is described by a set of occupied sites and the corresponding 
CMJ process describing the cloud of occupied (non-resolved sites) can be supercritical. First of all 
all dual clouds starting from different sites will become independent as N — > oo Furthermore since 
each site has a positive probability of resolving, in a growing cloud of unresolved sites, eventually 
one of these will resolve to and kill the whole expression. 

Since the probability that this has not yet happened is monotone decreasing and hence all 
moments converge to a limit which proves (|4.4I) . 

(c) In order to prove the convergence fix £ and consider (|4.7|) . As before Q ++,N will hit a trap 
at a finite random time. The point is then that as N — > oo the time to hit the trap occurs before 
any collisions occur tends to 1 and hence for very large N the calculation is essentially the same 
as for the case N = oo, that is, the McKean-Vlasov case which identifies the factor in (|4.5[) as /Lt^j. 

(d) Now consider the ergodic theorem for the M-type interacting system on the hierarchical 
group fijv in the case rriij > for all i, j £ {1, . . . , M}. We must now prove the convergence of 
the joint moments. To do this we start factors at each of £ different sites. 

The main point is that the dual system eventually hits a trap and as before in b) this will 
guarantee convergence of all moments. The spatial homogeneity follows from the one of the initial 
state if this has this property, but since all initial states lead to the same limit we are done. In 
fact we see that the system with collisions will tend to hit a trap faster that the one for the 
McKean-Vlasov dual. 

Furthermore we note that two initial clouds of factors starting at two sites which are at hierar- 
chical distance distance £ will have a probability to ever meet before they resolve to or 1 which 
tends to as £ — > oo. 

More precisely, if A is the parameter in the exponential tail of the trapping time, the probability 
of a collision between two clouds starting at two sites at hierarchical distance £ before trapping is 



(4.13) 



< const ■ 



A + 



N 2 



Therefore the all the mixed moments connected with two sites at distance £ factor asymptotically 
as I — > oo. Hence the limiting state is shift-ergodic. ■ 
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Index of Notation 

• Z - the set of integers 
. N = {1,2,3,...} 

• f2jv = ®^Zn , Zn cyclical group of order N 

• M.(E) denotes the space of finite Borel measures on a Polish space E 

• V(E) denotes the space of probability measures on the Borel field on a Polish space E. 



